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SPLICE VARIANTS OF ONCOGENES 

FIELD OF THE INVENTION 

The present invention concerns novel nucleic acid sequences, vectors and 
host cells containing them, amino acid sequences encoded by said sequences, and 
antibodies reactive with said amino acid sequences, as well as pharmaceutical 
5 compositions comprising any of the above. The present invention further concerns 
methods for screening for candidate activators or deactivators utilizing said amino 
acid sequences. The invention also concerns diagnostic assays utilizing said 
sequences. 

BACKGROUND OF THE INVENTION 

10 Many genes which are involved with tumors are involved with functions 

which encourage and promote growth and division of cells. Some tumor-involved 
cells are expressed only in tumor cells, while others are expressed also in normal 
cells albeit at lower levels. A specific example of tumor-involved genes are 
oncogenes, which are muted formed of proto-oncogenes. 

15 Generally, proto-oncogenes code for cellular proteins that relay signals to 

the cell's nuclei thus stimulating growth. These cellular proteins respond to 
signals from other cells and the signaling process involves several steps among 
them binding of growth and proliferation regulating factors to the cell membrane, 
release of second messenger, and a host of other intermediates, in the cell 

20 cytoplasm, and activation in the nucleus of transcription factors which move the 
cells through their growth cycles. 

Proto-oncogenes that code for these various components in the cascade 
may mutate, thus becoming oncogenes that keep the pathways continuously 
active regardless of the extracellular signals received by the cell. This may result 

25 in over-production of growth factors, flooding of the cell with replication signals, 
uncontrolled stimulation of the intermediary pathways and unrestrained cell 
growth driven by elevated levels of transcription factors. 
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The activation of a proto-oncogene to express its oncogenic potential may 
occur due to point mutation, chromosome rearrangement, gene amplification (an 
increase in the number of copies of normal proto-oncogenes within a cell) and 
viral insertion resulting in the control of the expression of the proto-oncogene by 
5 a more active promoter. 

Typically, oncogenes exhibit dominant phenotype at the cellular level, i.e. 
one copy of an activated oncogene is sufficient to produce its oncogenic effect, a 
phenomena which is termed "gain of function". There is usually a requirement to 
have more than one mutation in the proto-oncogene in order to change a normal 

10 cell line into neoplasia. The oncogene may be transmitted from generation to 
generation when a proto-oncogene mutates in the germ line, and since as 
indicated above usually more than one mutation is required, a single mutation 
results in a dominantly inherited tumor predisposition. 

The detection of oncogene is of major importance in the detection of 

15 tumors as well as in the detection of predisposition to a specific kind of tumor, 
which may result from additional mutations on an already mutated pro-oncogene. 
Oncogenes are detected by a plurality of methods among them PCR 
amplification, hybridization, as well as detection of the oncogenic product by 
various immunoassays. The understanding of the site of activity of the oncogene 

20 is of course of a major importance in the designing of a suitable therapeutical 
model for the treatment of the cancer resulting from the activity of said oncogene. 

Alternative splicing (AS) is an important regulatory mechanism in higher 
eukaryotes (P. A. Sharp, Cell 77, 805-8152 (1994). It is thought to be one of the 
most important mechanisms for differential expression related to tissue or 

25 development stage specificity. AS influences also: protein stability, protein 
clearance as well as tissue and cellular localization As may further alter protein 
function by increasing or decreasing the functionality,and may further affect post 
translational modifications, It is known to play a major role in numerous 
biological systems, including human antibody responses, and sex determination 

30 in Drosophila, (S. Stamm, M.Q. Zhang, T.G. Marr and D.M. Helfman, Nucleic 
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Acids Research 22, 1515-1526 (1994); B. Chabot, Trends Genet 12, 472-478 
(1996); R.E. Breitbart, A. Andreadis, B. Nadal-Ginard, Annual Rev. Biochem., 
56, 467-495 (1987); C.W. Smith, J.G. Patton, B. Nadal-Ginard, Annu. Rev, 
Genet, 27, 527-577 (1989)). 
5 Until recently it was commonly believed that alternative splicing existed in 

only a small fraction of genes (about 5%). A recent observation based on 
literature survey of known genes revises this conservative estimate to as high as 
an estimate that at least 30% of human genes are alternatively spliced (M.S. 
Gelfand, I. Dubchak, I. Draluk and M. Zorn, Nucleic Acids Research 27, 301-302 

10 (1999). The importance of the actual frequency of this phenomenon lies not only 
in the direct impact on the number of proteins created (100,000 human genes, for 
example, would be translated to a much higher number of proteins), but also in 
the diversity of functionality derived from the process. 

Several mechanisms at different stages may be held responsible for the 

15 complexity of higher eukaryote which include: alternative splicing at the 
transcription level, RNA editing at the post-transcriptional level, and 
post-translational modifications are the ones characterized to date. 

GLOSSARY 

20 In the following description and claims use will be made, at times, with a 

variety of terms, and the meaning of such terms as they should be construed in 
accordance with the invention is as follows: 

"Tumor-involved genes (TIG)" - genes for which there is some scientific 
25 indication linking their function, expression, or change in the level of their 
expressionto tumors. This term does not signify necessarily that the genes cause the 
tumor (although in some cases this is so) but may also indicate that the genes are a 
result of the tumor process, for example, they are activated by other genes which 
are the cause of the tumor. 

30 
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" Variant nucleic acid sequence" - the sequence shown in any one of SEQ ID 
NO: 1 to SEQ ID NO: 34, sequences having at least 90% identity (see below) to 
said sequence and fragments (see below) of the above sequences of least 20 b.p. 
long. These sequences are sequences coding for a novel, naturally occurring, 
5 alternative splice variants of native and known genes which are tumor-involved 
genes (TIG). It should be emphasized that the novel variants of the present 
invention are naturally occurring sequences resulting from alternative splicing of 
the TIGS and not merely truncated, mutated or fragmented forms of known 
tumor-involved sequences which are artificially produced. 

10 

"Variant product - also referred at times as the "variant protein" or "variant 
polypeptide" - is an amino acid sequence encoded by the variant nucleic acid 
sequence which is a naturally occurring mRNA sequence obtained as a result of 
alternative splicing. The amino acid sequence may be a peptide, a protein, as well 

15 as peptides or proteins having chemically modified amino acids (see below) such as 
a glycopeptide or glycoprotein. The variant products are shown in any one of SEQ 
ID NO: 35 to SEQ ID NO: 68. The term also includes homologues (see below) of 
said sequences in which one or more amino acids has been added, deleted, 
substituted (see below) or chemically modified (see below) as well as fragments 

20 (see below) of this sequence having at least 1 0 amino acids. 

"Nucleic acid sequence" - a sequence composed of DNA nucleotides, RNA 
nucleotides or a combination of both types and may includes natural nucleotides, 
chemically modified nucleotides and synthetic nucleotides. 

25 

"Amino acid sequence" - a sequence composed of any one of the 20 naturally 
appearing amino acids, amino acids which have been chemically modified (see 
below), or composed of synthetic amino acids. 
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"Fragment of variant nucleic acid sequence 9 ' - novel short stretch of nucleic 
acid sequences of at least 20 b.p., which does not appear as a continuous stretch 
in the original nucleic acid sequence (see below). The fragment may be a 
sequence which was previously undescribed in the context of the published RNA 
5 and which affects the amino acid sequence encoded by the known oncogene. For 
example, where the variant nucleic includes a sequence which was not included 
in the original sequence of the oncogene (for example a sequence which was an 
intron in the original sequence) the fragment may contain said additional 
sequence. The fragment may also be a region which is not an intron, which was 

10 not present in the original sequence of the TIG. For example where the variant 
lacks a non-terminal region which was present in the original sequence of the 
TIG. The two stretches of nucleotides spanning this region (upstream and 
downstream) are brought together by splicing in the variant, but are spaced from 
each by the spliced out region in the original sequence of the TIG and are thus 

15 not continuous in the original sequence. A continuous stretch of nucleic acids 
comprising said two sparing stretches of nucleotides is not present in the original 
sequence of the TIG and thus falls under the definition of fragment. 

"Fragments of variant products' 9 - novel amino acid sequences coded by the 
20 "fragment of variant nucleic acid sequence " defined above. 

"Homologues of variants 99 - amino acid sequences of variants in which one or 
more amino acids has been added, deleted or replaced. The addition, deletion or 
replacement should be in the regions or adjacent to regions where the variant 
25 differs from the original sequence (see below) of the TIG. 

''Conservative substitution" - refers to the substitution of an amino acid in one 
class by an amino acid of the same class, where a class is defined by common 
physicochemical amino acid side chain properties and high substitution 
30 frequencies in homologous proteins found in nature, as determined, for example, 
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by a standard Dayhoff frequency exchange matrix or BLOSUM matrix. [Six 
general classes of amino acid side chains have been categorized and include: 
Class I (Cys); Class II (Ser, Thr, Pro, Ala, Gly); Class III (Asn, Asp, Gin, Glu); 
Class IV (His, Arg, Lys); Class V (He, Leu, Val, Met); and Class VI (Phe, Tyr, 
5 Trp). For example, substitution of an Asp for another class III residue such as 
Asn, Gin, or Glu, is a conservative substitution. 

"Non-conservative substitution " - refers to the substitution of an amino acid in 
one class with an amino acid from another class; for example, substitution of an 
10 Ala, a class II residue, with a class III residue such as Asp, Asn, Glu, or Gin. 

"Chemically modified" - when referring to the product of the invention, means a 
product (protein) where at least one of its amino acid resides is modified either by 
natural processes, such as processing or other post-translational modifications, or 

15 by chemical modification techniques which are well known in the art. Among 
the numerous known modifications typical, but not exclusive examples include: 
acetylation, acylation, amidation, ADP-ribosylation, glycosylation, GPI anchor 
formation, covalent attachment of a lipid or lipid derivative, methylation, 
myristlyation, pegylation, prenylation, phosphorylation, ubiqutination, or any 

20 similar process . 

"Biologically active" - refers to the variant product having some sort of 
biological activity, for example, some physiologically measurable effect on target 
cells, molecules or tissues. 

25 

"Immunologically active" defines the capability of a natural, recombinant or 
synthetic varient product, or any fragment thereof, to induce a specific immune 
response in appropriate animals or cells and to bind with specific antibodies. 
Thus, for example, an immunologically active fragment of variant product 
30 denotes a fragment which retains some or all of the immunological properties of 
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the variant product, e.g can bind specific anti-variant product antibodies or which 
can elicit an immune response which will generate such antibodies or cause 
proliferation of specific immune cells which produce variant. 

5 "Optimal alignment" - is defined as an alignment giving the highest percent 
identity score. Such alignment can be performed using a variety of commercially 
available sequence analysis programs, such as the local ialignment program 
LALIGN using a ktup of 1, default parameters and the default PAM. A preferred 
alignment is the one performed using the CLUSTAL-W program from 

10 MacVector (TM), operated with an open gap penalty of 10.0, an extended gap 
penalty of 0.1, and a BLOSUM similarity matrix. If a gap needs to be inserted 
into a first sequence to optimally align it with a second sequence, the percent 
identity is calculated using only the residues that are paired with a corresponding 
amino acid residue (i.e., the calculation does not consider residues in the second 

15 sequences that are in the "gap" of the first sequence). In case of alignments of 
known gene sequences with that of the new variant, the optimal alignment 
invariably included aligning the identical parts of both sequences together, then 
keeping apart and unaligned the sections of the sequences that differ one from the 
other. 

20 

"Having at least 90% identity" - with respect to two amino acid or nucleic acid 
sequences, refers to the percentage of residues that are identical in the two 
sequences when the sequences are optimally aligned. Thus, 90% amino acid 
sequence identity means that 90% of the amino acids in two or more optimally 
25 aligned polypeptide sequences are identical, however this definition explicitly 
excludes sequences which are 100% identical with the original sequence from 
which the variant of the invention was varied. 



30 



"Isolated nucleic acid molecule having an variant nucleic acid sequence" - is a 
nucleic acid molecule that includes the coding variant nucleic acid sequence. Said 
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isolated nucleic acid molecule may include the variant nucleic acid sequence as 
an independent insert; may include the variant nucleic acid sequence fused to an 
additional coding sequences, encoding together ii fusion protein in which the 
variant coding sequence is the dominant coding sequence (for example, the 

5 additional coding sequence may code for a signal peptide); the variant nucleic 
acid sequence may be in combination with non-coding sequences, e.g., introns or 
control elements, such as promoter and terminator elements or 5' and/or 3' 
untranslated regions, effective for expression of the coding sequence in a suitable 
host; or may be a vector in which the variant protein coding sequence is a 

10 heterologous. 

"Expression vector 9 ' - refers to vectors that have the ability to incorporate and 
express heterologous DNA fragments in a foreign cell. Many prokaryotic and 
eukaryotic expression vectors are known and/or commercially available. 
15 Selection of appropriate expression vectors is within the knowledge of those 
having skill in the art. 

"Deletion" - is a change in either nucleotide or amino acid sequence in which 
one or more nucleotides or amino acid residues, respectively, are absent. 

20 

"Insertion" or "addition" - is that change in a nucleotide or amino acid 
sequence which has resulted in the addition of one or more nucleotides or amino 
acid residues, respectively, as compared to the naturally occurring sequence. 

25 "Substitution" - replacement of one or more nucleotides or amino acids by 
different nucleotides or amino acids, respectively. As regards amino acid 
sequences the substitution may be conservative or non- conservative. 

"Antibody" - refers to IgG, IgM, IgD, IgA, or IgG antibody. The definition 
30 includes polyclonal antibodies or monoclonal antibodies. This term refers to 
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whole antibodies or fragments of the antibodies comprising the antigen-binding 
domain of the anti-variant product antibodies, e.g. antibodies without the Fc 
portion, single chain antibodies, fragments consisting of essentially only the 
variable, antigen-binding domain of the antibody, etc. 

5 

Distinguishing antibody" - an antibody capable of binding to the variant product 
and not the original amino acid sequence of the tumor-involved gene from which 
it has been varied, or an antibody capable of binding to the original nucleic acid 
sequence and not to the variant product. 

10 

"Activator" - as used herein, refers to a molecule which mimics the effect of the 
natural variant product or at times even increases or prolongs the duration of the 
biological activity of said product, as compared to that induced by the variant 
product. The mechanism may be by any mechanism known to prolonging 

15 activities of biological molecules such as binding to receptors; prolonging the 
lifetime of the molecules; increasing the activity of the molecules on its target; 
increasing the affinity of molecules to its receptor; inhibiting degradation or 
proteolysis of the molecules, or mimicking the biological activity of the variants 
on their targets, etc. Activators may be polypeptides, nucleic acids, 

20 carbohydrates, lipids, or derivatives thereof, or any other molecules which can 
bind to and activate the variant product. 

"Deactivator" or ("Inhibitor") - refers to a. molecule which modulates the 
activity of the variant product in an opposite manner to that of the activator, by 
25 decreasing or shortening the duration of the biological activity of the variant 
product. This may be done by any mechanism known to deactivate or inhibit 
biological molecules such as block of the receptor, block of active site, 
competition on binding site in target, enhancement of degradation, etc. 
Deactivators may be polypeptides, nucleic acids, carbohydrates, lipids, or 
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derivatives thereof, or any other molecules which bind to and modulate the 
activity of said product. 

"Treating a disease 99 - refers to administering a therapeutic substance effective 
to ameliorate symptoms associated with a disease, to lessen the severity or cure 
the disease, or to prevent the disease from occurring. Typically the disease is 
cancer. 

"Detection" - refers to a method of detection of a cancer. This term may refer to 
detection of a predisposition to cancer as well as for establishing the prognosis of 
the patient by determining the severity of the disease, i.e. determining in which 
stage the cancer is. 

"Probe" - the variant nucleic acid sequence, or a sequence complementary 
therewith, when used to detect presence of other similar sequences in a sample. 
The detection is carried out by identification of hybridization complexes between 
the probe and the assayed sequence. The probe may be attached to a solid support 
or to a detectable label. 

"Original sequence" - the amino acid or nucleic acid sequence of the tumor- 
involved gene (TIG) from which the variant of the invention have been varied as 
a result of alternative slicing. This sequence will also be referred to at times as 
"tumor-involved-gene" (TIG). 

SUMMARY OF THE INVENTION 

The present invention is based on the finding of several novel, naturally 
occurring splice variants, which are naturally occurring sequences obtained by 
alternative splicing of known genes which expression was reported in scientific 
literature to be involved with tumors (hereinafter "tumor-involved genes" or 
"TIGS"). The above term does not signify that the gene necessarily caused the 



- 1 1 - 

tumor (although this may be so), merely that they are involved therewith (i.e. 
expressed in tumors) and this expression may be the result of other effects, for 
example, as a result of expression of other genes. The novel splice variants of the 
invention are not merely truncated forms, fragments or mutations of the known 

5 tumor-involved genes, but rather novel sequences which naturally occur within the 
body of individuals, and thus have physiological significance. 

The term "alternative splicing" in the context of the present invention and 
claims refers to: intron inclusion, exon exclusion, addition or deletion of terminal 
sequences in the variant as compared to the original sequences, as well as to the 

10 possibility of "intron retention". Intron retention is an intermediate stage in the 
processing of RNA transcripts, where prior to production of fully processed rnRNA 
the intron (naturally spliced in the original TIG sequence) is retained in the variant. 
These intermediately processed RNAs may have physiological significance and are 
also within the scope of the invention. 

15 The novel variant products of the invention may have the same 

physiological activity as the original tumor-involved peptide from which they have 
been varied (although perhaps at a different level); may have an opposite 
physiological activity from the activity featured by the original tumor-involved 
peptide from which they are varied; may have a completely different, unrelated 

20 activity to the activity of the original tumor-involved peptide which they are varied; 
or alternatively may have no activity at all and this may lead to various diseases or 
pathological conditions, especially cancer. Both in the case the variant has the same 
activity as well as the case it has the opposite activity as the original TIG sequence, 
it may differ from the TIG in its stability, its clearance, its tissue and cellular 

25 localization and in other biological properties not necessarily connected to activity. 

The novel variants may also serve for detection purposes, i.e. their presence 
or level may be cancer, a predisposition to cancer or the stage and aggression of the 
cancer disease, or alternatively the ratio between the level variants and the level 
original peptide from which they were varied, or the ratio to other variants (all 

30 obtained by alternative splicing from the same original sequence of the 
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tumor-involved gene) may be indicative of the presence of cancer, predisposition 
to cancer or the stage and aggressiveness of the cancer disease. 

-For example, for detectional purposes, it is possible to establish differential 
expression of various variants in various tissues. A certain variant may be 

5 expressed mainly in one tissue, while the original sequence (tumor-involved 
sequence) from which it has been varied, or another variant (obtained by alternative 
splicing from the same original tumor-involved sequence) may, be expressed 
mainly in another tissue. Understanding of the distribution of the variants in various 
tissues may be helpful in basic research, for understanding the physiological 

10 function of the original tumor-involved genes from which they have been varied, as 
well as help in targeting pharmaceuticals or in developing pharmaceuticals, and in 
establishing more accurate modalities of diagnosis. 

The study of the variants may also be helpful in distinguishing various 
stages in the life cycles of the same type of cells which may also be helpful for 

15 development of pharmaceuticals for various cancer stages in which cell cycles is 
non-normal. 

Thus the detection may by determination of the presence or the level of 
expression of the variant within a specific cell population, comparing said presence 
or level between various cell types in a tissue, between different tissues and 

20 between individuals. 

Thus the present invention provides by its first aspect, a novel isolated 
nucleic acid molecule comprising or consisting of any one of the coding sequence 
SEQ ID NO: 1 to SEQ ID NO: 34, fragments of said coding sequence having at 
least 20 nucleic acids (provided that said fragments are continuous stretches of 

25 nucleotides not present in the original sequence from which the variant was varied), 
or a molecule comprising a sequence having at least 90%, identity to SEQ ID 
NO: 1 to SEQ ID NO: 34, provided that the molecule is not completely identical to 
the original sequence of the tumor-involved gene from which the variant was 
varied. 
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The present invention further provides a protein or polypeptide comprising 
or consisting of an amino acid sequence encoded by any of the above nucleic acid 
sequences, termed herein "variant product", for example, an amino acid sequence 
having the sequence as depicted in any one of SEQ ID NO: 35 to SEQ ID NO: 68, 
5 fragments of the above amino acid sequence having a length of at least 10 amino 
acids coded by the above fragments of the nucleic acid sequences, as well as 
homologues of the above amino acid sequences in which one or more of the amino 
acid residues has been substituted (by conservative or non-conservative 
substitution) added, deleted, or chemically modified. 
10 The deletions, insertions and modifications should be in regions, or adjacent 

to regions, wherein the variant differs from the original sequence of the tumor- 
involved gene. 

For example, where the variant is different from the original sequence of the 
tumor- involved gene by addition of a short stretch of 10 amino acids, in the 

15 terminal or non- terminal portion of the peptide i.e. inclusion of an exon, the 
invention also concerns homologues of that variant where the additional short 
stretch is altered for example, it includes only 8 additional amino acids, includes 13 
additional amino acids, or it includes 10 additional amino acids, however some of 
them being conservative or non-conservative substitutes of the original additional 

20 10 amino acids of the novel variants. In all cases the changes in the homolog, as 
compared to the original tumor-involved sequence, are in the same regions where 
the variant differs from the original sequence, or in regions adjacent to said region. 

Another example is where the variant lacks a non-terminal region (for 
example of 20 amino acids) which is present in the original tumor-involved 

25 sequence (due for example to exon exclusion). The homologues may lack in the 
same region only 17 amino acids or 23 amino acids. Again the deletion is in the 
same region where the variant lacks a sequence as compared to the original 
tumor-involved sequence, or in a region adjacent thereto.lt should be appreciated 
that once a man versed in the art's attention is directed to the importance of a 

30 specific region, due to the fact that this region differs in the variant as compared to 
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the original sequence of the tumor-involved gene, there is no problem in derivating 
said specific region by addition to it, deleting from it, or substituting some amino 
acids in it. Thus homologues of variants which are derivated from the variant by 
changes (deletion, addition, substitution) only in said region as well as in regions 

5 adjacent to it are also a part of the present invention. Generally, if the variant is 
distinguished from the original sequence of the tumor-involved gene by some sort 
of physiological activity, then the homolog is distinguished from the original 
tumor-involved sequence in essentially the same manner. 

The present invention further provides nucleic acid molecule comprising or 

10 consisting of a sequence which encodes the above amino acid sequences, 
(including the fragments and homologues of the amino acid sequences). Due to the 
degenerative nature of the genetic code, a plurality of alternative nucleic acid 
sequences, beyond those depicted in any one of SEQ ID NO: 1 to SEQ ID NO: 34, 
can code for the amino acid sequences of the invention. Those alternative nucleic 

15 acid sequences which code for the same amino acid sequences codes by the 
sequence SEQ ID NO: 1 to SEQ ID NO: 34 are also an aspect of the of the present 
invention. 

The present invention further provides expression vectors and cloning 
vectors comprising any of the above nucleic acid sequences, as well as host cells 
20 transfected by said vectors. 

The present invention still further provides pharmaceutical compositions 
comprising, as an active ingredient, said nucleic acid molecules, said expression 
vectors, or said protein or polypeptide. 

These pharmaceutical compositions are suitable for the treatment of various 
25 cancers, which can be ameliorated or cured by raising the level of any one of the 
variant products of the invention. 

By a second aspect, the present invention provides a nucleic acid molecule 
comprising or consisting of a non-coding sequence which is complementary to that 
of any one of SEQ ID NO: 1 to SEQ ID NO: 34, or complementary to a sequence 
30 having at least 90% identity to said sequence (with the proviso added above) or a 
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fragment of said two sequences (according to the above definition of fragment). 
The complementary sequence may be a DNA sequence which hybridizes with any 
-one of SEQ of ID NO: 1 to SEQ ID NO: 34 or hybridizes to a portion of that 
sequence having a length sufficient to inhibit the transcription of the 
5 complementary sequence. The complementary sequence may be a DNA sequence 
which can be transcribed into an mRNA being an antisense to the mRNA 
transcribed from any one of SEQ ID NO: 1 to SEQ ID NO: 34 or into an mRNA 
which is an antisense to a fragment of the mRNA transcribed from any one of SEQ 
ID NO: 1 to SEQ ID NO: 34 which has a length sufficient to hybridize with the 

10 mRNA transcribed from SEQ ID NO: 1 to SEQ ED NO: 34, so as to inhibit its 
translation. The complementary sequence may also be the mRNA or the fragment 
of the mRNA itself. 

The nucleic acids of the second aspect of the invention may be used for 
therapeutic or diagnostic applications for example as probes used for the detection 

15 of the variants of the invention. The presence of the variant transcript or the level of 
the variant transcript may be indicative of cancer, predisposition to cancer or the 
stage or aggressiveness of the cancer disease. In addition or alternatively, the ratio 
of the level of the transcripts of the variants of the invention may also be compared 
to that of the transcripts of the original sequences of the oncogenes from which 

20 have been varied, or to the level of transcript of other variants (especially obtained 
by alternative splicing from the same original sequence), and said ratio may be 
indicative of cancer, predisposition to cancer or the stage or aggressiveness of the 
cancer disease 

The present invention also provides expression vectors comprising any one 
25 of the above defined complementary nucleic acid sequences and host cells 
transfected with said nucleic acid sequences or vectors, being complementary to 
those specified in the first aspect of the invention. 

The invention also provides anti-variant product antibodies, namely 
antibodies directed against the variant product which specifically bind to said 
30 variant product. Said antibodies are useful both for diagnostic and therapeutic 
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purposes. For example said antibody may be as an active ingredient in a 
pharmaceutical composition as will be explained below. 

The present invention also provides pharmaceutical compositions 
comprising, as an active ingredient, the nucleic acid molecules which comprise or 
5 consist of said complementary sequences, or of a vector comprising said 
complementary sequences. The pharmaceutical composition thus provides 
pharmaceutical compositions comprising, as an active ingredient, said anti-variant 
product antibodies. 

The pharmaceutical compositions comprising said anti-variant product 
10 antibodies or the nucleic acid molecule comprising said complementary sequence, 
are suitable for the treatment of diseases and pathological conditions where a 
therapeutically beneficial effect may be achieved by neutralizing the variant (either 
at the transcript or product level) or decreasing the amount of the variant product or 
blocking its binding to its target, for example, by the neutralizing effect of the 
15 antibodies, or by the effect of the antisense mRNA in decreasing the expression 
level of the variant sequence. In particular these diseases are cancer diseases and 
the treatment may also be for amelioration of cancer or for prevention of cancer 
purposes. 

According to the third aspect of the invention the present invention provides 
20 methods for detecting the level of the transcript (mRNA) of said variant product in 
a body fluid sample, or in a specific tissue sample, for example by use of probes 
comprising or consisting of said coding sequences; as well as methods for detecting 
levels of expression of said product in tissue, e.g. by the use of antibodies capable 
of specifically reacting with the variant products of the invention. Detection of the 
25 level of the expression of the variant of the invention in particular as compared to 
that of the original tumor-involved gene sequence from which it was varied or 
compared to other variant sequences all varied from the same original TIG 
sequence may be indicative of a cancer, predisposition to cancer or the stage or 
aggressiveness of the cancer disease 
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The method, according to this latter aspect, for detection of a nucleic acid 
sequence which encodes the variant product in a biological sample, comprises the 
steps of: - 

(a) providing a probe comprising at least one of the nucleic acid 
5 sequences defined above; 

(b) contacting the biological sample with said probe under conditions 
allowing hybridization of nucleic acid sequences thereby enabling formation of 
hybridization complexes; 

(c) detecting hybridization complexes, wherein the presence of the 
10 complexes indicates the presence of nucleic acid sequence encoding the variant 

product in the biological sample. 

The method as described above is qualitative, i.e. indicates whether the 
transcript is present in or absent from the sample. The method can also be 
quantitative, by determining the level of hybridization complexes and then 
15 calibrating said levels to determining levels of transcripts of the desired variant in 
the sample. 

Both qualitative and quantitative determination methods can be used for 
diagnostic, prognostic and therapy planning purposes, especially in conjunction 
with cancer diseases. In addition qualitative determination may be indicative of the 
20 cancer stage. 

By a preferred embodiment the probe is part of a nucleic acid chip used for 
detection purposes, i.e. the probe is a part of an array of probes each present in a 
known location on a solid support. 

The nucleic acid sequence used in the above method may be a DNA 
25 sequence an RNA sequence, etc; it may be a coding or a sequence or a sequence 
complementary thereto (for respective detection of RNA transcripts or 
coding-DNA sequences). By quantization of the level of hybridization complexes 
and calibrating the quantified results it is possible also to detect the level of the 
transcript in the sample. 
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Methods for detecting mutations in the region coding for the variant product 
are also provided, which may be methods carried-out in a binary fashion, namely 
merely detecting whether there is any mismatches between the normal variant 
nucleic acid sequence of the invention and the one present in the sample, or 
5 carried-out by specifically detecting the nature and location of the mutation. 
Detection of mutations may be of importance in the determination of predisposition 
to cancer, as well as in attempts to establish the prognosis of the cancer disease. 

The present invention also concerns a method for detecting variant product 
in a biological sample, comprising the steps of: 
10 (a) contacting with said biological sample the antibody of the invention, 

thereby forming an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

wherein the presence of said antibody-antigen complex correlates with the 
presence of variant product in said biological sample, 
is Many diseases are diagnosed by detecting the presence of antibodies against 

a protein characterizing the disease in the blood, serum or any other body fluid of 
the patient. The present invention also concerns a method for detecting anti-variant 
antibody in a biological sample, comprising: 

(a) contacting said sample with the variant product of the invention, 
20 thereby forming an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

wherein the presence of said antibody-antigen complex correlates with the 
presence of anti- variant antibody in the sample. 

As indicated above, both methods (for detection of variant product and for 
25 detection of the anti-variant antibody) can be quantitized to determine the level or 
the amount of the variant or antibody in the sample, alone or in comparison to the 
level of the original amino acid tumor-involved sequence from which it was varied 
or compared to the level of antibodies against the original amino acid sequence, 
and qualitative and quantitative results may be used for diagnostic, prognostic and 
30 therapy planning purposes. 
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The invention also concerns distinguishing antibodies, i.e. antibodies 
capable of binding either to the variant product or to the original tumor-involved 
gene sequence from which the variant has been varied, while not binding to the 
original sequence or the variant product respectively. These distinguishing 
5 antibodies may be used for detection purposes. 

By yet another aspect the invention also provides a method for identifying 
candidate compounds capable of binding to the variant product and modulating its 
activity (being either activators or deactivators). The method includes: 

(i) providing a protein or polypeptide comprising an amino acid 
10 sequence substantially as depicted in any one of SEQ ID NO: 35 to 68, or a 

fragment of such a sequence; 

(ii) contacting a candidate compound with said amino acid sequence; 

(iii) measuring the physiological effect of said candidate compound on 
the activity of the amino acid sequences and selecting those compounds which 

15 show a significant effect on said physiological activity. 

The present invention also concerns compounds identified by the above 
methods described above, which compound may either be an activator of the 
variant product or a deactivator thereof. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

In order to understand the invention and to see how it may be carried out in 
practice, a preferred embodiment will now be described, by way of non-limiting 
example only, with reference to the accompanying drawings, in which: 

Fig. 1 is a comparison between the amino acid sequence of SEQ ID NO: 35 
25 and the original tumor-involved sequence from which it has been varied; 

Fig- 2 is a comparison between the amino acid sequence of SEQ ID NO: 36 
and the original tumor-involved sequence from which it has been varied; 

Fig. 3 is a comparison between the amino acid sequence of SEQ ID NO: 37 
and the original tumor-involved sequence from which it has been varied; 
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Fig. 4 is a comparison between the amino acid sequence of SEQ ID NO: 38 
and the original tumor-involved sequence from which it has been varied; 

Fig. 5 is a comparison between the amino acid sequence of SEQ ID NO: 39 
and the original tumor-involved sequence from which it has been varied; 
5 Fig. 6 is a comparison between the amino acid sequence of SEQ ID NO: 40 

and the original tumor-involved sequence from which it has been varied; 

Fig. 7 is a comparison between the amino acid sequence of SEQ ID NO: 41 
and the original tumor-involved sequence from which it has been varied; 

Fig. 8 is a comparison between the amino acid sequence of SEQ ID NO: 42 
10 and the original tumor-involved sequence from which it has been varied; 

Fig. 9 is a comparison between the amino acid sequence of SEQ ID NO: 43 
and the original tumor-involved sequence from which it has been varied; 

Fig. 10 is a comparison between the amino acid sequence of SEQ ID 
NO: 44 and the original tumor-involved sequence from which it has been varied; 
15 Fig. 11 is a comparison between the amino acid sequence of SEQ ID 

NO: 45 and the original tumor-involved sequence from which it has been varied; 

Fig. 12 is a comparison between the amino acid sequence of SEQ ID 
NO: 46 and the original tumor-involved sequence from which it has been varied; 

Fig. 13 is a comparison between the amino acid sequence of SEQ ID 
20 NO: 47 and the original tumor-involved sequence from which it has been varied; 

Fig. 14 is a comparison between the amino acid sequence of SEQ ID 
NO: 48 and the original tumor-involved sequence from which it has been varied; 

Fig. 15 is a comparison between the amino acid sequence of SEQ ID 
NO: 49 and the original tumor-involved sequence from which it has been varied; 
25 Fig. 16 is a comparison between the amino acid sequence of SEQ ID 

NO: 50 and the original tumor-involved sequence from which it has been varied; 

Fig. 17 is a comparison between the amino acid sequence of SEQ ID 
NO: 5 1 and the original tumor-involved sequence from which it has been varied; 

Fig. 18 is a comparison between the amino acid sequence of SEQ ID 
30 NO: 52 and the original tumor-involved sequence from which it has been varied; 
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Fig. 19 is a comparison between the amino acid sequence of SEQ ID 
NO: 53 and the original tumor-involved sequence from which it has been varied; 

Fig. 20 is a comparison between the amino acid sequence of SEQ ID 
NO: 54 and the original tumor-involved sequence from which it has been varied; 
5 Fig. 21 is a comparison between the amino acid sequence of SEQ ID 

NO: 55 and the original tumor-involved sequence from which it has been varied; 

Fig. 22 is a comparison between the amino acid sequence of SEQ ID 
NO: 56 and the original tumor-involved sequence from which it has been varied; 

Fig. 23 is a comparison between the amino acid sequence of SEQ ID 
10 NO: 57 and the original tumor-involved sequence from which it has been varied; 

Fig. 24 is a comparison between the amino acid sequence of SEQ ID 
NO: 58 and the original tumor-involved sequence from which it has been varied; 

Fig. 25 is a comparison between the amino acid sequence of SEQ ID 
NO: 59 and the original tumor-involved sequence from which it has been varied; 
15 Fig. 26 is a comparison between the amino acid sequence of SEQ ID 

NO: 60 and the original tumor-involved sequence from which it has been varied; 

Fig. 27 is a comparison between the amino acid sequence of SEQ ID 
NO: 61 and the original tumor-involved sequence from which it has been varied; 

Fig. 28 is a comparison between the amino acid sequence of SEQ ID 
20 NO: 62 and the original tumor-involved sequence from which it has been varied; 

Fig. 29 is a comparison between the amino acid sequence of SEQ ID 
NO: 63 and the original tumor-involved sequence from which it has been varied; 

Fig. 30 is a comparison between the amino acid sequence of SEQ ID 
NO: 64 and the original tumor-involved sequence from which it has been varied; 
25 Fig. 31 is a comparison between the amino acid sequence of SEQ ID 

NO: 65 and the original tumor-involved sequence from which it has been varied; 

Fig. 32 is a comparison between the amino acid sequence of SEQ ID 
NO: 66 and the original tumor-involved sequence from which it has been varied; 

Fig. 33 is a comparison between the amino acid sequence of SEQ ID 
30 NO: 67 and the original tumor-involved sequence from which it has been varied; 
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Fig. 34 is a comparison between the amino acid sequence of SEQ ID 
NO: 67 and the original tumor-involved sequence from which it has been varied; 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
Example I: Comparison of variants with original sequences 

Original sequences of tumor-involved genes were obtained from GenBank 
Version 115. Their tumor involvement was determined by comparison between 
5 the original sequences and the noval variant sequences was made using the 
BestFit application from the GCG suite version 10.0 (January 1999), with the 
defalut values: 

Gap creation penalty (Gap Weight): 50 

Gap extension penalty (GapLength Weight): 3 
10 The comparison is shown in Fig. 1 to 35 which show the comparison of 

each of the variant products depicted in SEQ ID NO: 35 to 68 with the original 
tumor-involved sequence from which it was varied. 

The following is a list which gives the name and the description of each 
original tumor-involved sequence from which the alternative splice variant has 
15 been varied by alternative splicing. The description is followed by the internal 
reference to the novel variant (NV-... etc.) and a short comparison between the 
variant and the original tumor-involved sequence. It should be noticed that several 
splice variants may have been originated from the same parent sequence by several 
different alternative splicings. The following table summarizes the accession 
20 number of the original sequence, the terminology of the new variant (NV-1 to 
NV-34) and the description of the difference between the new variant and the 
original sequence. 

Table 



25 



Accession 


New variant # 


Description of the new variant 


KU70_HUMAN 


NV-1 


The new variant has an alternative 3 5 
exon of 5 aa instead of 240 amino 
acids. It is probably missing the 
PHOSPHORYLATION (BY 
NUCLEAR KINASE Nil) site and 
half of the PRO-RICH domain but 
retains the LEUCINE-ZIPPER 
domain. 
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KU /U_rlUMAJN 


IN V-Z 


The new variant has a deletion of 2 1 0 
aa between residues 304- 515. Lacks 
the Pro-rich domain but retains the 
LEUCINE-ZIPPER domain and 
PHOSPHORYLATION (BY 
NUCLEAR KINASE Nil) site. 


LCKJHUMAN 


NV-3 


The new variant has an alternative 3' 
exon of 45 amino acids instead of 163 
amino acids. The new variant retains 
both SH domains and most of the 
rRUlJblJN KJJNAISr!, domain including 
two ATP BINDING sites and the 
ACTIVE SITE. It is missing the 3' 
end of the PROTEIN KINASE 
domain and lacks the 
AUTO-PHOSPHORYLATION and 
PHOSPHORYLATION sites. 


LCK_HUMAN 


NV-4 


Insertion of 58 amino acids after 
amino acid 62 (insertion does not 
result in truncation). Insertion in first 
SH2 domain. The new variant retains 
all important sites including: the 
PROTEIN KINASE DOMAIN with 
two ATP BINDING sites, an ACTIVE 
SITE and an AUTO 
PHOSPHORYLATION site. An 
additional PHObrHOR YJLA 1 1UJN 
site. 


OSTP_HUMAN 


NV-5 


The new variant has an alternative 3' 
exon of 12 aa instead of 134 aa. The 
new variant maintains the CELL 
ATTACHMENT SITE and two 
GLYCOS1LA11UJN sites. 


GA45_HUMAN 


NV-6 


The new variant has an alternative 5 5 
exon of 72 amino acids instead of 125 
amino acids. The new variant has a 
signal peptide and has the two 
PHOSPHORYLATION (BY CK2} 
sites. 


WN11_HUMAN 


NV-7 


The new variant has a deletion of 22 
amino acids after residue 312 
(between 312-334). The new variant 
has all five potential 
GLYCOSILATION sites. 
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vi /xt 1 1 UT TKA A M 
WIN 1 1 _xl U IVIAIN 


MV-8 

IN V O 


X I1C I1CW VaTlcUll I lab a. tlClClKJll OI 11 / 

amino acids after residue 116 
(between 1 16-233). The new variant is 
missing one potential 
GLYCOSILATION site (out of 5 
sites). 


KPCT_HUMAN 


NY-9 


The new variant has an alternative 3' 
exon of 3 amino acids instead of 94 
amino acids. The alternative region is 
in LQc proiein Kinase aomain. i ne new 
variant maintains the two 
PHORBOL-ESTER AND DAG 
BINDING domains, the two ATP 
binding sites and the active site of the 
Kinase aomain. 


IRF1_HUMAN 


NV-10 


The new variant has an alternative 3' 
exon of 7 amino acids instead of 40 
amino acids. The new variant 
maintains the DNA binding domain. 


FGR1_HUMAN 


NV-11 


The new variant has an alternative 3 5 
exon of 14 amino acids instead of 134 
amino acids. The new variant has the 
entire extracellular domain and the 
TM, it is missing part of the 
cytoplasmic domain. The new variant 
maintains all 3 IMMUNO- 

PT /"YQT TT TXT T TVt? f\n\yf A TXTC fV, a 

(jrLOoUIJJN-JLllsji IJOJVJLAJLiN o, tne 

protein KINASE domain, the 
ACTIVE site, and the 2 ATP binding 
sites, but it might be missing one of 
the two PHOSPHORYLATION 
(AUTO-1 sites 


APE1_HUMAN 


NV-12 


The new variant has a gap of 22 amino 
acids between residues 146- 169. The 
new variant maintains the active site 
and site important for substrate 
recognition. 
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APE 1_HUMAN 


NV-13 


The new variant has an insertion of 25 
amino acids after residue 18. The new 
variant maintains the active site and 

bile linpuruini xor suDStrate 
recognition. 


MAD3HUMAN 


NV-14 


The new variant has an alternative 3' 
exon of 3 amino acids instead of 15 
amino acids. It retains all five ANK 
motifs and the two 
Jr JrLlJor rlL^xv x JL/\ 1 Sites. 


MAD3_HUMAN 


NV-15 


The new variant has a deletion of 28 
amino acids between 183 - 212. The 
deletion is in the ANK MOTIF 4. The 
new variant maintains 4 out of the five 
ANK MOTIFs and the two 
PHOSPHORYLATION sites. 


EPA4HUMAN 


NV-16 


Deletion of 65aa after residue 832 
(832-898). Deletion in end of 
CYTOPLASMIC domain. The 3 5 end 
of the PROTEIN KINASE domain is 
missing, but all important sites are 
maintained. The new variant has two 
FYBRONECTIN TYPE III domains 
and the protein KINASE domain with 
z Air binding sites, an AUiivti site 
and an auto PHOSPHORYLATION 
site. 


ETS2_HUMAN 


NV-17 


The new variant has a deletion of 26 
aa between 87 -114. The new variant 
maintains the DNA binding domain. 


WN5A HUMAN 
1. 


NV-18 


The new variant has an alternative 3' 
exon of 4 amino acids instead of 109. 
It is identical to the known protein 
until residue 256. Two 
GLYCOSILATION sites out of four 
are missing in the new variant. 


TY03_HUMAN 


NV-19 


The new variant has an alternative 3' 
exon of 45 amino acids instead of 216 
amino acids. The new variant is 
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missing part of the PROTEIN 
KINASE domain and its 
AUTOPHOSPHORYLATION site. 

However it maintains all r*th#*r 

ii,l/WWWl) 11 lllclllllCllllO &11 vJidlCl 

necessary domains: the ACTIVE site 
and the two ATP binding sites. The 
variant retains all 6 
GLYCOSILATION sites, the 2 
IG-like domains and the 2 
FIBRONEXTIN TYPE III domains. 


CAD2HUMAN 


NV-20 


The new variant has an alternative 3' 
exon of 10 amino acids instead of 68 
amino acids. The new variant 
maintains 

tHf* f*Yl"T*jir i f*1 Inl jit" Hnm^tn anH th** TTV/T 

domain. It is missing the end of the 
cytoplasmic domain and the 
SER-RICH domain. However, it has 
all other necessary domains including 
•5 CADHFRTN REPEATS with 7 

. V,,> r~\ 1 J I II 1 1 1VL/1 J — /XX. J. U Willi / 

GLYCOSILATION sites. 


MXI1_HUMAN 


NV-21 


NV_1 m85527_3 

Insertion of 24aa after residue 79. 
Most likely truncated in insertion. Has 
basic DNA binding domain, but lacks 
helix-loop-helix. 


MXI1_HUMAN 


NV-22 


NV_2 m85527_5 

Alternative 5 ' exon. Identical to 
Known iroin ad zo lu uic eiiu. na^ a. j 
exon of 31 aa versus 25 aa of the 
known. 

Has both DNA binding domain and 
helix loop helix. 

The alternative 5' exon bares a 
clathrin repeat. Supported by 4 ests. 


JVLr is. D _r± U IVl/VLN 


IN V J 


QJmilar to Vnnwn T?7\T A at fir^t 9QO aa 

Alternative V exon of 10 aa instead of 
28 The new variant maintains the 
PROTEIN KINASE domain with its 
two ATP binding sites, the ACTIVE 
site and two POSPHORYLATION 
sites. 


XRC1_HUMAN 


NV-24 


First 242aa identical to known RNA. 
Alternative short 3 f exon of 50 aa 
Instead of 391aa. 
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XRC 1 HUMAN 


NV-25 


T J ^ * 1 a 1 _ "VTA * j ^» a -t 

Identical to known RNA in first 241 
aa. Alternative 3 1 exon of 25 aa 
instead of 392. 


XRC1_HUMAN 


NV-26 


Identical to known RNA in first 186 
aa. Alternative short 3' exon of length 
61 aa, instead of 447 aa. 


XRC1_HUMAN 


NV-27 


Identical to known RNA in first 540 
aa. Alternative 3* exon of 84 aa 
instead of 
93 aa. 


MERL_HUMAN 


NV-28 


Deletion of 29 aa from position 333. 
The new variant retains the Band 4-1 
like domain. (Band 4.1, which links 
the spectrin-actin cytoskeleton of 
erythrocytes to the plasma 
membrane.) 


DP1_HUMAN 


NV-29 


Alternative 3 1 exon of 21 amino acids 
instead of 72 amino acids. The new 
variant retains the two transmembrane 
domains. 


MDR1 HUMAN 


NV-30 


Alternative exon at 3' end at 
cytoplasmic domain. 1 aa instead of 3 
of the known. Identical to known until 
aa 1277. 


MDR1_HUMAN 


NV-31 


The new variant is a truncated protein. 
It has an alternative 3 5 exon of 12 
amino acids instead of 713. It is 
identical to the known protein until 
residue 567. The new variant retains 
only one out of two ATP binding sites, 
and six out of twelve TM domains. It 
has one out of three cytoplasmic 
domains and is truncated in the middle 
or uie seconu cyiupitioinic uuiikuii. 


MKO 8_HUMAN 


NV-32 


Identical to known until aa 205. 
Truncated. Has additional 13 aa. 
Lacks part of the protein kinase 
domain. Retains the active site the two 
ATP binding sites and the two 
phosphorylation sites. 
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MK08 HUMAN 


NV-33 


Alternative 3' exon of 14 aa instead of 
1 34 aa. Identical to known until 
residue 293. Lacks end of protein 
kinase domain. 

Retains the active site, the two ATP 
binding sites and the two 
phosphorylation sites. 


MKO 8_HUMAN 


NV-34 


Alternative 3 ' exon of 7 aa instead of 
95 aa. Identical to known until residue 
332. Has entire protein kinase domain 
including the active site, the two ATP 
binding sites and the two 
phosphorylation sites. 



The following is a list of the original tumor-involved sequences, followed 
by all the splice variants obtained therefrom with a list of differences between the 
5 original TIG sequence and the variant. 

KU (p70/p80) 
KU70_HUMAN 

10 

FUNCTION : SINGLE STRANDED DNA-DEPENDENT ATP-DEPENDENT 
HELICASE. HAS A ROLE IN CHROMOSOME TRANSLOCATION. THE 
DNA HELICASE II COMPLEX BINDS PREFERENTIALLY TO FORK-LIKE 
ENDS OF DOUBLE-STRANDED DNA IN A CELL CYCLE-DEPENDENT 
15 MANNER. IT WORKS IN THE 3'-5' DIRECTION. BINDING TO DNA MAY 
BE MEDIATED BY P70. 

SUBUNIT : HETERODIMER OF A 70 KD AND A 80 KD SUBUNIT. 
SUBCELLULAR LOCATION : NUCLEAR. 

PTM : PHOSPHORYLATED IN VIVO AT SERINE RESIDUES (BY 
20 SIMILARITY). 

DISEASE : INDIVIDUALS WITH SLE AND RELATED DISORDERS 
PRODUCE EXTREMELY LARGE AMOUNTS OF AUTOANTIBODIES TO 
P70 AND P86. EXISTENCE OF A MAJOR AUTOANTIGENIC EPITOPE OR 
EPITOPES ON THE CARBOXY TERMINAL 190 AMINO ACIDS OF P70 
25 CONTAINING THE LEUCINE REPEAT. THE MAJORITY OF 
AUTOANTIBODIES TO P70 IN MOST SERA FROM PATIENTS WITH SLE 
SEEM TO BE REACTIVE WITH THIS REGION. 
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SIMILARITY : BELONGS TO THE ATP-DEPENDENT DNA HELICASE II 70 
KD SUB UNIT FAMILY. 

5 NV_1 

The new variant has an alternative 3' exon of 5 amino acids instead of 240 
amino acids. It is probably missing the PHOSPHORYLATION (BY NUCLEAR 
KINASE Nil) site and half of the PRO-RICH domain but retains the 
io LEUCINE-ZIPPER domain. 

KU (p70/p80) 
KU70JHUMAN 

15 

NV_2 

The new variant has a deletion of 210 amino acids between residues 304 - 
515. The new variant lacks the PRO-RICH domain but retains the 
20 LEUCINE-ZIPPER domain and PHOSPHORYLATION (BY NUCLEAR 
KINASE Nil) site. 

LCK 

25 LCK_HUMAN 

PROTO-ONCOGENE TYRO SINE-PROTEIN KINASE LCK 

FUNCTION : MAY PARTICIPATE IN ANTIGEN-INDUCED T-CELL 

ACTIVATION. 

30 CATALYTIC ACTIVITY : ATP + A PROTEIN TYROSINE = ADP + 
PROTEIN TYROSINE PHOSPHATE. 

ENZYME REGULATION : REGULATED BY PHOSPHORYLATION ON 
TYR-504. 

SUBCELLULAR LOCATION : BOUND TO THE CYTOPLASMIC DOMAIN 
35 OF EITHER CD4 OR CD8. 

SIMILARITY : CONTAINS 1 SH2 DOMAIN. 
SIMILARITY : CONTAINS 1 SH3 DOMAIN. 

SIMILARITY : TO OTHER PROTEIN-TYROSINE KINASES IN THE 
CATALYTIC DOMAIN. BELONGS TO THE SRC SUBFAMILY. 
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NV_3 

The new variant has an alternative 3' exon of 45 amino acids instead of 
163 amino acids. The new variant retains both SH domains and most of the 
5 PROTEIN KINASE domain including two ATP BINDING sites and the 
ACTIVE SITE. It is missing the 3' end of the PROTEIN KINASE domain and 
lacks the AUTO-PHOSPHORYLATION and PHOSPHORYLATION sites. 

LCK 

10 

LCKHUMAN 

NV_4 

15 Insertion of 58 amino acids after amino acid 62 (insertion does not result 

in truncation). Insertion in first SH2 domain. The new variant retains all 
important sites including: the PROTEIN KINASE DOMAIN with two ATP 
BINDING sites, an ACTIVE SITE and an AUTO PHOSPHORYLATION site. 
An additional PHOSPHORYLATION site. 

20 

OSTEOPONTIN 
OSTP HUMAN 

25 FUNCTION : BINDS TIGHTLY TO HYDROXYAPATITE. APPEARS TO 
FORM AN INTEGRAL PART OF THE MINERALIZED MATRIX. 
PROBABLY IMPORTANT TO CELL-MATRIX INTERACTION. 
ALTERNATIVE PRODUCTS : TWO ISOFORMS; OP1A AND OP IB 
(SHOWN HERE); ARE PRODUCES BY ALTERNATIVE SPLICING. 

30 PTM : EXTENSIVELY PHOSPHORYLATED ON SERINE RESIDUES. 
PTM : N- AND O-GLYCOSYLATED. 

DISEASE : THIS PROTEIN PLAYS A PRINCIPAL ROLE IN URINARY 
STONE FORMATION AS THE STONE MATRIX 
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NV_5 

The new variant has an alternative 3' exon of 12 amino acids instead of 
134 amino acids. The new variant maintains the CELL ATTACHMENT SITE 
5 and two GLYCOSILATION sites. 

GADD45 
GA45HUMAN 

10 

GROWTH ARREST AND DNA-DAMAGE-INDUCIBLE PROTEIN 
FUNCTION: INVOLVED IN THE REGULATION OF GROWTH AND 
APOPTOSIS. MEDIATES ACTIVATION OF STRESS-RESPONSIVE 
MTK1/MEKK4 MAPKKK. 
1 5 SIMILARITY: BELONGS TO THE GADD45 / MYD 1 1 8 FAMILY. 

NV_6 

The new variant has an alternative 5' exon of 72 amino acids instead of 
20 125 amino acids. The new variant has a signal peptide and has the two 
PHOSPHORYLATION (BY CK2) sites. 

WNT-11 PROTEIN 

2 5 WN11_HUMAN 

FUNCTION : PROBABLE DEVELOPMENTAL PROTEIN. MAY BE A 
SIGNALING MOLECULE WHICH AFFECT THE DEVELOPMENT OF 
DISCRETE REGIONS OF TISSUES. IS LIKELY TO SIGNAL OVER ONLY 
30 FEW CELL 
DIAMETERS. 

SUBCELLULAR LOCATION : POSSIBLY SECRETED AND ASSOCIATES 
WITH THE EXTRACELLULAR MATRIX. 
SIMILARITY: BELONGS TO THE WNT FAMILY 
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NV_7 

The new variant has a deletion of 22 amino acids after residue 312 
(between 312-334). The new variant has all five potential GLYCOSILATION 
5 sites. 

WNT-11 PROTEIN 
WN11_HUMAN 

10 

NV_8 

The new variant has a deletion of 117 amino acids after residue 116 
15 (between 1 16-233). The new variant is missing one potential GLYCOSILATION 
site (out of 5 sites). 

PROTEIN KINASE C, THETA TYPE 
KPCTHUMAN 

20 

FUNCTION : THIS IS CALCIUM-INDEPENDENT, PHOSPHOLIPID- 
DEPENDENT, SERINE- AND THREONINE-SPECIFIC ENZYME. 
FUNCTION : PKC IS ACTIVATED BY DIACYLGLYCEROL WHICH IN 
TURN PHOSPHORYLATES A RANGE OF CELLULAR PROTEINS. PKC 
25 ALSO SERVES AS THE RECEPTOR FOR PHORBOL ESTERS, A CLASS OF 
TUMOR 
PROMOTERS. 

TISSUE SPECIFICITY : SKELETAL MUSCLE, MEGAKARYOBLASTIC 
CELLS AND PLATELETS. 
30 SIMILARITY : CONTAINS 2 ZINC-DEPENDENT PHORBOL-ESTER AND 
DAG BINDING DOMAINS. 

SIMILARITY : BELONGS TO THE SER/THR FAMILY OF PROTEIN 
KINASES. PKC SUBFAMILY. 
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NV_9 

The new variant has an alternative 3 ' exon of 3 amino acids instead of 94 
amino acids. The alternative region is in the PROTEIN KINASE domain. The 
new variant maintains the two PHORBOL-ESTER AND DAG BINDING 
5 domains, the two ATP binding sites and the ACTIVE of the KINASE domain. 

INTERFERON REGULATORY FACTOR 1 
IRF1HUMAN 

io FUNCTION : SPECIFICALLY BINDS TO THE UPSTREAM REGULATORY 
REGION OF TYPE I IFN AND IFN-INDUCIBLE MHC CLASS I GENES 
(THE INTERFERON CONSENSUS SEQUENCE (ICS)) AND ACTIVATES 
THOSE GENES. 

SUBCELLULAR LOCATION : NUCLEAR. 
15 INDUCTION : BY VIRUSES AND IFN. 

DISEASE : DELETION OR REARRANGEMENT OF IRF1 ARE A CAUSE OF 
PRELEUKEMIC MYELODYSPLASTIC SYNDROME (MDS) AND OF 
ACUTE MYELOGENOUS LEUKEMIA (AML). 
SIMILARITY : BELONGS TO THE IRF FAMILY. 

20 

NV_10 

The new variant has an alternative 3 ' exon of 7 amino acids instead of 40 
amino acids. The new variant maintains the DNA binding domain. 

25 BASIC FIBROBLAST GROWTH FACTOR RECEPTOR 1 

FGR1HUMAN 

FUNCTION : RECEPTOR FOR BASIC FIBROBLAST GROWTH FACTOR. A 
SHORTER FORM OF THE RECEPTOR COULD BE A RECEPTOR FOR 
30 ACIDIC FGF (AFGF). 

CATALYTIC ACTTVITY : ATP + PROTEIN TYROSINE = ADP + PROTEIN 
TYROSINE PHOSPHATE. 

SUBCELLULAR LOCATION: TYPE I MEMBRANE PROTEIN. 
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ALTERNATIVE PRODUCTS : MANY FORMS OF FGFR1 ARE PRODUCED 
BY ALTERNATIVE SPLICING. THE FORM SHOWN HERE IS KNOWN AS 
ALPHA-A1. 

DISEASE : DEFECTS IN FGFR1 ARE ONE OF THE CAUSES OF PFEIFFER 
5 SYNDROME, ALSO CALLED ACROCEPHALOSYNDACTYLY TYPE V 
(ACS V), CHARACTERIZED BY CRANIOSYNOSTOSIS (PREMATURE 
FUSION OF THE SKULL SUTURES) WITH DEVIATION AND 
ENLARGEMENT OF THE THUMBS AND GREAT 
TOES,BRACHYMESOPHALANGY, WITH PHALANGEAL ANKYLOSIS 
10 AND A VARYING DEGREE OF SOFT TISSUE SYNDACTYLY. 

SIMILARITY : BELONGS TO THE FIBROBLAST GROWTH FACTOR 
RECEPTOR FAMILY. 

SIMILARITY : CONTAINS 3 IMMUNOGLOBULIN-LIKE DOMAINS. 
15 NV_11 

The new variant has an alternative 3' exon of 14 amino acids instead of 
134 amino acids. The new variant has the entire extracellular domain and the TM, 
it is missing part of the cytoplasmic domain. The new variant maintains all 3 
IMMUNOGLOBULIN-LIKE DOMAINS, the protein KINASE domain, the 
20 ACTIVE site, and the 2 ATP binding sites, but it might be missing one of the two 
PHOSPHORYLATION (AUTO-) sites. 

REF-1 PROTEIN DNA-(APURINIC OR AP YRIMIDINIC SITE) LYASE 

APE 1_HUM AN 

25 

FUNCTION : REPAIRS OXIDATIVE DNA DAMAGES IN VITRO. MAY 
HAVE A ROLE IN PROTECTION AGAINST CELL LETHALITY AND 
SUPPRESSION OF MUTATIONS. REMOVES THE BLOCKING GROUPS 
FROM THE 3* 

so TERMINI OF THE DNA STRAND BREAKS GENERATED BY IONIZING 
RADIATIONS AND BLEOMYCIN. 



CATALYTIC ACTIVITY : ENDONUCLEOLYTIC CLEAVAGE NEAR 
APURINIC OR APYRIMIDINIC SITES TO PRODUCTS WITH 
5*-PHOSPHATE. 

SUBCELLULAR LOCATION : NUCLEAR. 

SIMILARITY : BELONGS TO THE AP/EXOA FAMILY OF DNA REPAIR 
ENZYMES. 

NV_12 

The new variant has a gap of 22 amino acids between residues 146-169. 
The new variant maintains the ACTIVE site and site important for substrate 
recognition. 

NV_13 

The new variant has an insertion of 25 amino acids after residue 18. It 
maintains the ACTIVE site and the site important for substrate recognition. 

MAD3 MAJOR HISTOCOMPATIBILITY COMPLEX 
ENHANCER-BINDING PROTEIN 

MAD3 HUMAN 

FUNCTION : I-KAPPA-B-LIKE ACTIVITY. MAY BE INVOLVED IN 
REGULATION OF TRANSCRIPTIONAL RESPONSES TO NF-KAPPA-B, 
INCLUDING ADHESION- DEPENDENT PATHWAYS OF MONOCYTE 
ACTIVATION. INTERACTS DIRECTLY WITH THE NF-KAPPA-B 
COMPLEX, PRESUMABLY THROUGH THE P65 SUBUNIT. 
INDUCTION : INDUCED IN ADHERENT MONOCYTES. 
PTM : PHOSPHORYLATION OF I-KAPPA-B BLOCKS ITS ABILITY TO 
INHIBIT NF-KAPPA-B DNA-BINDING ACTIVITY. 
SIMILARITY: CONTAINS 5 ANK REPEATS. 
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NV_14 

The new variant has an alternative 3' exon of 3 amino acids instead of 15 
amino acids. It retains all five ANK motifs and the two PHOSPHORYLATION 
5 sites. 

NV_15 

The new variant has a deletion of 28 amino acids between 183 - 212. The 
io deletion is in the ANK MOTIF 4.The new variant maintains 4 out of the five 
ANK MOTIFs and the two PHORYLATION sites. 

RECEPTOR PROTEIN-TYROSINE KINASE HEK8 

15 EPA4_HUMAN 

FUNCTION : RECEPTOR FOR MEMBERS OF THE EPHRIN-A FAMILY. 
BINDS TO EPHRIN-A 1, -A4 AND -A5. BINDS MORE POORLY TO 
EPHRIN-A2 AND A- 3. 
20 CATALYTIC ACTIVITY : ATP + A PROTEIN TYROSINE = ADP + 
PROTEIN TYROSINE PHOSPHATE. 

SUBCELLULAR LOCATION : TYPE I MEMBRANE PROTEIN. 
SIMILARITY : CONTAINS 2 FIBRONECTIN TYPE III-LIKE DOMAINS. 
SIMILARITY : TO OTHER PROTEIN-TYROSINE KINASES IN THE 
25 CATALYTIC DOMAIN. BELONGS TO THE EPHRIN RECEPTOR FAMILY. 

NV16 

Deletion of 65 amino acids between 832 - 898. The deletion in the 
cytoplasmic domain. The 3' end of the PROTEIN KINASE domain is missing, 
30 but all important sites are maintained. The new variant has two FYBRONECTIN 
TYPE III domains and the protein KINASE domain with 2 ATP binding sites, an 
ACTIVE site and an auto PHOSPHORYLATION site. 



C-ETS-2 PROTEIN 
ETS2 HUMAN 



SUBCELLULAR LOCATION : NUCLEAR. 
SIMILARITY : BELONGS TO THE ETS FAMILY. 

NVJL7 

The new variant has a deletion of 26 amino acids between 87 - 114. The 
new variant maintains the DNA binding domain. 

WNT-5A PROTEIN 
WN5A_HUMAN 

FUNCTION : PROBABLE DEVELOPMENTAL PROTEIN. MAY BE A 
SIGNALING MOLECULE WHICH AFFECTS THE DEVELOPMENT OF 
DISCRETE REGIONS OF TISSUES. IS LIKELY TO SIGNAL OVER 
ONLY FEW CELL DIAMETERS. 

SUBCELLULAR LOCATION : POSSIBLY SECRETED AND ASSOCIATES 
WITH THE EXTRACELLULAR MATRIX. 
SIMILARITY : BELONGS TO THE WNT FAMILY 

NV_18 

The new variant has an alternative 3' exon of 4 amino acids instead of 
109. It is identical to the known protein until residue 256. Two 
GLYCOSILATION sites out of four are missing in the new variant. 
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TYROSINE-PROTEIN KINASE SKY 
TY03HUMAN 

FUNCTION : MAY BE INVOLVED IN CELL ADHESION PROCESSES, 
5 PARTICULARLY IN THE CENTRAL NERVOUS SYSTEM. 
SUBCELLULAR LOCATION : TYPE I MEMBRANE PROTEIN. 
TISSUE SPECIFICITY : ABUNDANT IN THE BRAIN AND LOWER LEVELS 
IN OTHER TISSUES. 

SIMILARITY : TO OTHER PROTEIN-TYRO SINE KINASES IN THE 
10 CATALYTIC DOMAIN. 

SIMILARITY : CONTAINS 2 IMMUNOGLOBULIN-LIKE C2-TYPE 
DOMAINS. 

SIMILARITY : CONTAINS 2 FIBRONECTIN TYPE III-LIKE DOMAINS. 
15 NV_19 

The new variant has an alternative 3' exon of 45 amino acids instead of 
216 amino acids. The new variant is missing part of the PROTEIN KINASE 
domain and its AUTOPHOSPHORYLATION site. However, it maintains all 
other necessary domains: the ACTIVE site and the two ATP binding sites. The 
20 variant retains all 6 GLYCOSILATION sites, the 2 IG-like domains and the 2 
FIBRONEXTIN TYPE III domains. 

NEURAL-CADHEREN 
CAD2_HUMAN 

25 

FUNCTION : CADHERINS ARE CALCIUM DEPENDENT CELL ADHESION 
PROTEINS. THEY PREFERENTIALLY INTERACT WITH THEMSELVES 
IN A HOMOPHILIC MANNER IN CONNECTING CELLS; CADHERINS 
MAY THUS CONTRIBUTE TO THE SORTING OF HETEROGENEOUS 
30 CELL TYPES. N-CADHERIN MAY BE INVOLVED IN NEURONAL 
RECOGNITION MECHANISM. 

SUBCELLUT , AR LOCATION : TYPE I MEMBRANE PROTEIN. 
SIMILARITY : BELONGS TO THE CADHERIN FAMILY. 
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NV_20 

The new variant has an alternative 3' exon of 10 amino acids instead of 68 
amino acids. The new variant maintains the extracellular domain and the TM 
domain. It is missing the end of the cytoplasmic domain and the SER-RICH 
5 domain. However, it has all other necessary domains including: 5 CADHERIN 
REPEATS with 7 GLYCOSILATION sites. 

MXI1 MAX INTERACTING PROTEIN 1 
MXI1HUMAN 

10 

FUNCTION : TRANSCRIPTIONAL REPRESSOR. MXI1 BINDS WITH MAX 
TO FORM A SEQUENCE-SPECIFIC DNA-BINDING PROTEIN COMPLEX 
WHICH RECOGNIZES THE CORE SEQUENCE 5'-CAC[GA]TG-3'. MXU 
THUS ANTAGONIZES MYC TRANSCRIPTIONAL ACTIVITY BY 
is COMPETING FOR MAX. 

SUBUNIT : EFFICIENT DNA BINDING REQUIRES DIMERIZATION WITH 
ANOTHER BHLH PROTEIN .BINDS DNA AS A HETERODIMER WITH 
MAX. 

SUBCELLULAR LOCATION : NUCLEAR. 
20 TISSUE SPECIFICITY : HIGH LEVELS FOUND IN THE BRAIN, HEART 
AND LUNG WHILE LOWER LEVELS ARE SEEN IN THE LIVER, KIDNEY 
AND SKELETAL MUSCLE. 

DISEASE : DEFECTS IN MXU ARE FOUND IN SOME PATIENTS WITH 
PROSTATE TUMORS. 
25 SIMILARITY : BELONGS TO THE BASIC HELLX-LOOP-HELIX (BHLH) 
FAMILY OF TRANSCRIPTION FACTORS. 

NV_21 

The new variant has an insertion of 24 amino acids after residue 79. It is 
30 most likely truncated within the insertion. The new variant retains the BASIC 
DNA BINDING domain, but lacks the HELIX LOOP HELLX motif. 
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MXI1 MAX INTERACTING PROTEIN 1 
MXI1_HUMAN 

NV_22 

5 The new variant has an alternative 5' exon of 31 amino acids instead of 

25. It is identical to the known protein from residue 26 to the end. The new 
variant has both the DNA BINDING DOMAIN and the HELIX LOOP HELIX 
motif. The alternative 5' exon bares a clathrin repeat. 

i o DUAL SPECIFICITY MITOGEN-ACTIVATED PROTEIN KINASE 

KINASE 3 
MPK3_HUMAN 

FUNCTION : DUAL SPECIFICITY KINASE. IS ACTIVATED BY 
15 CYTOKINES AND ENVIRONMENTAL STRESS IN VIVO. CATALYZES 
THE CONCOMITANT PHOSPHORYLATION OF A THREONINE AND A 
TYROSINE RESIDUE IN THE MAP KINASE P38. 

ENZYME REGULATION : ACTIVATED BY DUAL PHOSPHORYLATION 
ON SER-189 AND THR-193. 
20 TISSUE SPECIFICITY : ABUNDANT EXPRESSION IS SEEN IN THE 
SKELETAL MUSCLE. IT IS ALSO WIDELY EXPRESSED IN OTHER 
TISSUES. 

PTM: AUTOPHOSPHORYLATED. 

SIMILARITY : BELONGS TO THE SER/THR FAMILY OF PROTEIN 
25 KINASES. MAP KINASE KINASE SUBFAMILY. 

NV_23 

The new variant has an alternative 3' exon of 10 amino acids instead of 28 
amino acids. The new variant maintains the PROTEIN KINASE domain with its 
30 two ATP binding sites, the ACTIVE site and two POSPHORYLATION sites.lt 
may lack a few amino acids at the end of the PROTEIN KINASE domain. 
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DNA-REPAIR PROTEIN XRCC1 
XRC1 HUMAN 



FUNCTION- CORRECTS DEFECTIVE DNA STRAND-BREAK REPAIR 
5 AND SISTER CHROMATID EXCHANGE FOLLOWING TREATMENT 
WITH IONIZING RADIATION AND ALKYLATING AGENTS. 
QTmrFT .T .TJLA R LOCATION : NUCLEAR (PROBABLE). 
SIMILARITY : SOME, TO S.POMBE RAD4/CUT5. 



10 NV_24 

Alternative 3' exon of 50 amino acids instead of 391 amino acids. 



DNA -REPAIR PROTEIN XRCC1 
XRCIJHUMAN 

15 

NV_25 

Alternative 3' exon of 25 amino acids instead of 392 amino acids. 

DNA-REPAER PROTEIN XRCC1 
XRC1 HUMAN 



20 



NV_26 

Alternative 3' exon of 61 amino acids instead of 447 amino acids. 



25 DNA-REPAIR PROTEIN XRCC1 

XRC1 HUMAN 



NV_27 

Alternative 3' exon of 84 amino acids instead of 93 amino acids. 

30 
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MERLIN SCHWANNOMIN (NF2) 
MERLHUMAN 

FUNCTION : PROBABLY ACTS AS A MEMBRANE STABILIZING 
5 PROTEIN. 

TISSUE SPECIFICITY: IN FETAL BRAIN; IN KIDNEY, LUNG, BREAST, 
AND OVARY. 

DISEASE : NEUROFIBROMATOSIS 2 (NF2) OR CENTRAL 
NEUROFIBROMATOSIS IS A GENETIC DISORDER CHARACTERIZED 

i o BY BILATERAL VESTIBULAR SCHWANNOMAS (FORMERLY CALLED 
ACOUSTIC NEUROMAS), SCHWANNOMAS OF OTHER CRANIAL AND 
PERIPHERAL NERVES,MENINGIOMAS, AND EPENDYMOMAS. IT IS 
INHERITED IN AN AUTOSOMAL DOMINANT FASHION WITH FULL 
PENETRANCE. AFFECTED INDIVIDUALS GENERALLY DEVELOP 

15 SYMPTOMS 

OF EIGHTH-NERVE DYSFUNCTION IN EARLY ADULTHOOD, 
INCLUDING DEAFNESS AND BALANCE DISORDER. ALTHOUGH THE 
TUMORS OF NF2 ARE HISTOLOGICALLY BENIGN, THEIR ANATOMIC 
LOCATION MAKES MANAGEMENT DIFFICULT, AND PATIENTS 
20 SUFFER GREAT MORBIDITY AND MORTALITY. 

SIMILARITY : CONTAINS A DOMAIN FOUND IN BAND 4.1, EZRIN, 
MOESIN, RADIXIN, AND TALIN. 

NV_28 

25 The new variant has a deletion of 29 amino acids after residue 3 3 3. The 

new variant maintains the BAND 4,1 - LIKE domain. (Band 4.1, which links the 
spectrin-actin cytoskeleton of erythrocytes to the plasma membrane). 

DPI POLYPOSIS LOCUS PROTEIN 1 
30 DPI HUMAN 

SUBCELLULAR LOCATION : INTEGRAL MEMBRANE PROTEIN 
(POTENTIAL). 

SIMILARITY : TO C.ELEGANS T19C3.4. 

i 
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NV_29 

Alternative 3' exon of 21 amino acids instead of 72 amino acids. The new 
variant maintains the two transmembrane domains. 

MDR1 MULTIDRUG RESISTANCE PROTEIN 1 
MDR1 HUMAN 



FUNCTION : ENERGY-DEPENDENT EFFLUX PUMP RESPONSIBLE FOR 
DECREASED DRUG ACCUMULATION IN MULTIDRUG-RESISTANT 
io CELLS. 

SUBCELLULAR LOCATION : INTEGRAL MEMBRANE PROTEIN. 
SIMILARITY : BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN 
FAMILY (ABC TRANSPORTERS). MDR SUBFAMILY. 

15 NV_30 

The new variant has an alternative 3 ' exon of 1 amino acid, instead of 3 
of the known protein. The new variant is identical to the known protein until 
residue 1277. It maintains all important sites. 

20 MDR1 MULTIDRUG RESISTANCE PROTEIN 1 

MDR1HUMAN 

NV_31 

The new variant is a truncated protein. It has an alternative 3' exon of 12 
25 amino acids instead of 713. It is identical to the known protein until residue 567. 
The new variant retains only one out of two ATP binding sites, and six out of 
twelve TM domains. It has one out of three cytoplasmic domains and is truncated 
in the middle of the second cytoplasmic domain. 



-45- 



JNK1 - MITOGEN ACTIVATED PROTEIN KINASE 8 

MK08_HUMAN 

5 FUNCTION : PROBABLY PLAYS A ROLE IN THE SIGNAL 
TRANSDUCTION PATHWAY INITIATED BY PROINFLAMMATORY 
CYTOKINES AND UV RADIATION. BINDS TO THE N-TERMINAL 
ACTIVATION DOMAINS OF C-JUN AND ATF2 AND PHOSPHORYLATES 
THEIR REGULATORY SITES (RESPECTIVELY SER-63 AND SER-73; 

10 THR-69 AND THR-71). JNK1 ISOFORMS DISPLAY DIFFERENT BINDING 
PATTERNS: BETA-1 PREFERENTIALLY BINDS TO C-JUN, WHEREAS 
ALPHA- 1, ALPHA- 2, AND BETA-2 HAVE A SIMILAR LOW LEVEL OF 
BINDING TO BOTH C-JUN OR ATF2. HOWEVER THERE IS NO 
CORRELATION BETWEEN BINDING AND PHOSPHORYLATION, WHICH 

15 IS ACHIEVED ABOUT AT THE SAME EFFICIENCY BY ALL ISOFORMS. 
ENZYME REGULATION : ACTIVATED BY THREONINE AND TYROSINE 
PHOSPHORYLATION. 

ALTERNATIVE PRODUCTS : FOUR ISOFORMS JNK1 ALPHA- 1, JNK1 
ALPHA-2 (SHOWN HERE), JNK1 BETA-1, AND JNK1 BETA-2 
20 ARE PRODUCED BY ALTERNATIVE SPLICING. 

INDUCTION : BY UV LIGHT, INTERLEUKIN- 1 AND BY HA-RAS. 
SIMILARITY : BELONGS TO THE CDC2/CDC28 SUBFAMILY OF SER/THR 
PROTEIN KINASES. STRONGEST SIMILARITY WITH OTHER MAP 
KINASES. 

NV_32 

The new variant is a truncated protein. It has an alternative 3' exon of 13 
amino acids instead of 222 amino acids. It is identical to the known protein until 
residue 205. The new variant lacks part of the PROTEIN KINASE domain, 
30 however it retains the ACTIVE SITE, the two ATP binding sites and the two 
PHOSPHORYLATION sites. 
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JNK1 - MITOGEN ACTIVATED PROTEIN KINASE 8 

MK08 JHUM AN 

NV33 

5 The new variant has an alternative 3' exon of 14 amino acids instead of 

134 amino acids of the known protein. It is identical to the known protein until 
residue 293. The new variant lacks the end of the PROTEIN KINASE domain, 
but retains the ACTIVE SITE, the two ATP binding sites and the two 
PHOSPHORYLATION sites. 

10 

JNK1 - MITOGEN ACTIVATED PROTEIN KINASE 8 

MK08_HUMAN 

15 NV34 

The new variant has an alternative 3' exon of 7 amino acids a instead of 
95 amino acids. It is identical to the known protein until residue 332. Has the 
entire PROTEIN KINASE domain including the ACTIVE SITE, the two ATP 
binding sites and the two PHOSPHORYLATION sites. 

20 

Example II: Variant nucleic acid sequence 

The nucleic acid sequences of the invention include nucleic acid 
sequences which encode variant product and fragments and analogs thereof. The 
nucleic acid sequences may alternatively be sequences complementary to the 

25 above coding sequence, or to a region of said coding sequence. The length of the 
complementary sequence is sufficient to avoid the expression of the coding 
sequence. The nucleic acid sequences may be in the form of RNA or in the form 
of DNA, and include messenger RNA, synthetic RNA and DNA, cDNA, and 
genomic DNA. The DNA may be double-stranded or single-stranded, and if 

30 single-stranded may be the coding strand or the non-coding (anti-sense, 
complementary) strand. The nucleic acid sequences may also both include 
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dNTPs, rNTPs as well as non naturally occurring sequences. The sequence may 
also be a part of a hybrid between an amino acid sequence and a nucleic acid 
sequence. 

In a general embodiment, the nucleic acid sequence has at least 90%, 
5 identity with any one of the sequence identified as SEQ ID NO: 1 to SEQ ID 
NO: 34 provided that this sequence is not completely identical with that of the 
original sequence. 

The nucleic acid sequences may include the coding sequence by itself. By 
another alternative the coding region may be in combination with additional 

10 coding sequences, such as those coding for fusion protein or signal peptides, in 
combination with non-coding sequences, such as introns and control elements, 
promoter and terminator elements or 5' and/or 3' untranslated regions, effective 
for expression of the coding sequence in a suitable host, and/or in a vector or host 
environment in which the variant nucleic acid sequence is introduced as a 

15 heterologous sequence. 

The nucleic acid sequences of the present invention may also have the 
product coding sequence fused in-frame to a marker sequence which allows for 
purification of the variant product. The marker sequence may be, for example, a 
hexahistidine tag to provide for purification of the mature polypeptide fused to 

20 the marker in the case of a bacterial host, or, the marker sequence may be a 
hemagglutinin (HA) tag when a mammalian host, e.g. COS-7 cells, is used. The 
HA tag corresponds to an epitope derived from the influenza hemagglutinin 
protein (Wilson, L, et al Cell 37:767 (1984)). 

Also included in the scope of the invention are fragments as defined above 

25 also referred to herein as oligonucleotides, typically having at least 20 bases, 
preferably 20-30 bases corresponding to a region of the coding-sequence nucleic 
acid sequence. The fragments may be used as probes, primers, and when 
complementary also as antisense agents, and the like, according to known 
methods. 
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As indicated above, the nucleic acid sequence may be substantially a 
depicted in any one of SEQ ID NO: 1 to SEQ ID NO: 34 or fragments thereof or 

- sequences having at least 90% identity to the above sequence as explained above. 
Alternatively, due to the degenerative nature of the genetic code, the sequence 

5 may be a sequence coding for any one of the amino acid sequence of SEQ ID 
NO: 35 to SEQ ID NO: 68, or fragments or analogs of said amino acid sequence. 

A. Preparation of nucleic acid sequences 

The nucleic acid sequences may be obtained by screening cDNA libraries 

10 using oligonucleotide probes which can hybridize to or PCR-amplify nucleic acid 
sequences which encode the variant products disclosed above. cDNA libraries 
prepared from a variety of tissues are commercially available and procedures for 
screening and isolating cDNA clones are well-known to those of skill in the art. 
Such techniques are described in, for example, Sarnbrook et al (1989) Molecular 

15 Cloning: A Laboratory Manual (2nd Edition), Cold Spring Harbor Press, 
Plainview, N.Y. and Ausubel FM et al. (1989) Current Protocols in Molecular 
Biology, John Wiley & Sons, New York, N.Y. 

The nucleic acid sequences may be extended to obtain upstream and 
downstream sequences such as promoters, regulatory elements, and 5 f and 3 f 

20 untranslated regions (UTRs). Extension of the available transcript sequence may 
be performed by numerous methods known to those of skill in the art, such as 
PCR or primer extension (Sarnbrook et al, supra), or by the RACE method 
using, for example, the Marathon RACE kit (Clontech, Cat. # Kl 802-1). 

Alternatively, the technique of "restriction-site" PCR (Gobinda et al. PCR 

25 Methods Applic. 2:318-22, (1993)), which uses universal primers to retrieve 
flanking sequence adjacent a known locus, may be employed. First, genomic 
DNA is amplified in the presence of primer to a linker sequence and a primer 
specific to the known region. The amplified sequences are subjected to a second 
round of PCR with the same linker primer and another specific primer internal to 
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the first one. Products of each round of PCR are transcribed with an appropriate 

RNA polymerase and sequenced using reverse transcriptase. 

Inverse PCR can be used to amplify or extend sequences using divergent 

primers based on a known region (Triglia, T. et al 9 Nucleic Acids Res. 16:8186, 
5 (1988)). The primers may be designed using OLIGO(R) 4.06 Primer Analysis 

Software (1992; National Biosciences Inc, Plymouth, Minn.), or another 

appropriate program, to be 22-30 nucleotides in length, to have a GC content of 

50% or more, and to anneal to the target sequence at temperatures about 68-72°C. 

The method uses several restriction enzymes to generate a suitable fragment in 
10 the known region of a gene. The fragment is then circularized by intramolecular 

ligation and used as a PCR template. 

Capture PCR (Lagerstrom, M. et al, PCR Methods Applic. 1:111-19, 

(1991)) is a method for PCR amplification of DNA fragments adjacent to a 

known sequence in human and yeast artificial chromosome DNA. Capture PCR 
15 also requires multiple restriction enzyme digestions and ligations to place an 

engineered double-stranded sequence into a flanking part of the DNA molecule 

before PCR. 

Another method which may be used to retrieve flanking sequences is that 
of Parker, J.D., et al y Nucleic Acids Res., 19:3055-60, (1991)). Additionally, one 
20 can use PCR, nested primers and PromoterFinder™ libraries to "walk in" genomic 
DNA (PromoterFinder™; Clontech, Palo Alto, CA). This process avoids the need 
to screen libraries and is useful in finding intron/exon junctions. Preferred 
libraries for screening for full length cDNAs are ones that have been size-selected 
to include larger cDNAs. Also, random primed libraries are preferred in that they 
25 will contain more sequences which contain the 5' and upstream regions of genes. 

A randomly primed library may be particularly useful if an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries are useful for 
extension into the 5 1 nontranslated regulatory region. 

The nucleic acid sequences and oligonucleotides of the invention can also 
30 be prepared by solid-phase methods, according to known synthetic methods. 
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Typically, fragments of up to about 100 bases are individually synthesized, then 
joined to form continuous sequences up to several hundred bases. 

B. Use of variant nucleic acid sequence for the production of 
5 variant products 

In accordance with the present invention, nucleic acid sequences specified 
above may be used as recombinant DNA molecules that direct the expression of 
variant products. 

io As will be understood by those of skill in the art, it may be advantageous 

to produce variant product-encoding nucleotide sequences possessing codons 
other than those which appear in any one of SEQ ID NO: 1 to SEQ ID NO: 34 
which are those which naturally occur in the human genome. Codons preferred 
by a particular prokaryotic or eukaryotic host (Murray, E. et ah Nuc Acids Res., 

15 17:477-508, (1989)) can be selected, for example, to increase the rate of variant 
product expression or to produce recombinant RNA transcripts having desirable 
properties, such as a longer half-life, than transcripts produced from naturally 
occurring sequence. 

The nucleic acid sequences of the present invention can be engineered in 

20 order to alter a variant product coding sequence for a variety of reasons, 
including but not limited to, alterations which modify the cloning, processing 
and/or expression of the product. For example, alterations may be introduced 
using techniques which are well known in the art, e.g., site-directed mutagenesis, 
to insert new restriction sites, to alter glycosylation patterns, to change codon 

25 preference, etc. 

The present invention also includes recombinant constructs comprising 
one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a nucleic acid 
sequence of the invention has been inserted, in a forward or reverse orientation. 

30 In a preferred aspect of this embodiment, the construct further comprises 
regulatory sequences, including, for example, a promoter, operably linked to the 
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sequence. Large numbers of suitable vectors and promoters are known to those 
of skill in the art, and are commercially available. Appropriate cloning and 
expression vectors for use with prokaryotic and eukaryotic hosts are also 
described in Sambrook, et ai t (supra). 

5 The present invention also relates to host cells which are genetically 

engineered with vectors of the invention, and the production of the product of the 
invention by recombinant techniques. Host cells are genetically engineered (i.e., 
transduced, transformed or transfected) with the vectors of this invention which 
may be, for example, a cloning vector or an expression vector. The vector may 

10 be, for example, in the form of a plasmid, a viral particle, a phage, etc. The 
engineered host cells can be cultured in conventional nutrient media modified as 
appropriate for activating promoters, selecting transformants or amplifying the 
expression of the variant nucleic acid sequence. The culture conditions, such as 
temperature, pH and the like, are those previously used with the host cell selected 

15 for expression, and will be apparent to those skilled in the art. 

The nucleic acid sequences of the present invention may be included in 
any one of a variety of expression vectors for expressing a product. Such vectors 
include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., 
derivatives of SV40; bacterial plasmids; phage DNA; baculovirus; yeast 

20 plasmids; vectors derived from combinations of plasmids and phage DNA, viral 
DNA such as vaccinia, adenovirus, fowl pox virus, and pseudorabies. However, 
any other vector may be used as long as it is replicable and viable in the host. 
The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into an appropriate 

25 restriction endonuclease site(s) by procedures known in the art. Such procedures 
and related sub-cloning procedures are deemed to be within the scope of those 
skilled in the art. 

The DNA sequence in the expression vector is operatively linked to an 
appropriate transcription control sequence (promoter) to direct mRNA synthesis. 
30 Examples of such promoters include: LTR or SV40 promoter, the E.coli lac or 
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trp promoter, the phage lambda PL promoter, and other promoters known to 
control expression of genes in prokaryotic or eukaryotic cells or their viruses. 
The expression vector also contains a ribosome binding site for translation 
initiation, and a transcription terminator. The vector may also include 

5 appropriate sequences for amplifying expression. In addition, the expression 
vectors preferably contain one or more selectable marker genes to provide a 
phenotypic trait for selection of transformed host cells such as dihydrofolate 
reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicillin resistance in E.coli. 

10 The vector containing the appropriate DNA sequence as described above, 

as well as an appropriate promoter or control sequence, may be employed to 
transform an appropriate host to permit the host to express the protein. Examples 
of appropriate expression hosts include: bacterial cells, such as E.coli, 
Streptomyces, Salmonella typhimurium; fungal cells, such as yeast; insect cells 

15 such as Drosophila and Spodoptera Sf9; animal cells such as CHO, COS, HEK 
293 or Bowes melanoma; adenoviruses; plant cells, etc. The selection of an 
appropriate host is deemed to be within the scope of those skilled in the art from 
the teachings herein. The invention is not limited by the host cells employed. 

In bacterial systems, a number of expression vectors may be selected 

20 depending upon the use intended for the variant product. For example, when 
large quantities of variant product are needed for the induction of antibodies, 
vectors which direct high level expression of fusion proteins that are readily 
purified may be desirable. Such vectors include, but are not limited to, 
multifunctional E.coli cloning and expression vectors such as Bluescript(K) 

25 (Stratagene), in which the variant polypeptide coding sequence may be ligated 
into the vector in-frame with sequences for the amino-terminal Met and the 
subsequent 7 residues of beta-galactosidase so that a hybrid protein is produced; 
pIN vectors (Van Heeke & Schuster J. Biol Chem. 264:5503-5509, (1989)); pET 
vectors (Novagen, Madison WI); and the like. 
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In the yeast Saccharomyces cerevisiae a number of vectors containing 
constitutive or inducible promoters such as alpha factor, alcohol oxidase and 
PGH may be used. For reviews, see Ausubel et al. (supra) and Grant et al, 
(Methods in Enzymology 153:516-544, (1987)). 
5 In cases where plant expression vectors are used, the expression of a 

sequence encoding variant product may be driven by any of a number of 
promoters. For example, viral promoters such as the 35S and 19S promoters of 
CaMV (Brisson et al., Nature 310:511-514. (1984)) may be used alone or in 
combination with the omega leader sequence from TMV (Takamatsu et al, 

10 EMBO 7., 6:307-311, (1987)). Alternatively, plant promoters such as the small 
subunit of RUBISCO (Coruzzi et al, EMBO J. 3:1671-1680, (1984); Broglie et 
al, Science 224:838-843, (1984)); or heat shock promoters (Winter J and 
Sinibaldi R.M., Results Probl Cell Differ., 17:85-105, (1991)) may be used. 
These constructs can be introduced into plant cells by direct DNA transformation 

15 or pathogen-mediated transfection. For reviews of such techniques, see Hobbs S. 
or Murry L.E. (1992) in McGraw Hill Yearbook of Science and Technology, 
McGraw Hill, New York, N.Y., pp 191-196; or Weissbach and Weissbach (1988) 
Methods for Plant Molecular Biology, Academic Press, New York, N.Y., pp 
421-463. 

20 Variant product may also be expressed in an insect system. In one such 

system, Autographa califomica nuclear polyhedrosis virus (AcNPV) is used as a 
vector to express foreign genes in Spodoptera frugiperda cells or in Trichoplusia 
larvae. The variant product coding sequence may be cloned into a nonessential 
region of the virus, such as the polyhedrin gene, and placed under control of the 

25 polyhedrin promoter. Successful insertion of variant coding sequence will render 
the polyhedrin gene inactive and produce recombinant virus lacking coat protein 
coat. The recombinant viruses are then used to infect S. frugiperda cells or 
Trichoplusia larvae in which variant protein is expressed (Smith et al, J. Virol. 
46:584, (1983); Engelhard, E.K. et al, Proc. Nat. Acad. Set 91:3224-7, (1994)). 
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In mammalian host cells, a number of viral-based expression systems may 
be utilized. In cases where an adenovirus is used as an expression vector, a 
variant product coding sequence may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter arid tripartite 
5 leader sequence. Insertion in a nonessential El or E3 region of the viral genome 
will result in a viable virus capable of expressing variant protein in infected host 
cells (Logan and Shenk, Proc. Natl Acad. Set 81:3655-59, (1984). In addition, 
transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be 
used to increase expression in mammalian host cells. 

io Specific initiation signals may also be required for efficient translation of 

a variant product coding sequence. These signals include the ATG initiation 
codon and adjacent sequences. In cases where variant product coding sequence, 
its initiation codon and upstream sequences are inserted into the appropriate 
expression vector, no additional translational control signals may be needed. 

15 However, in cases where only coding sequence, or a portion thereof, is inserted, 
exogenous transcriptional control signals including the ATG initiation codon 
must be provided. Furthermore, the initiation codon must be in the correct 
reading frame to ensure transcription of the entire insert. Exogenous 
transcriptional elements and initiation codons can be of various origins, both 

20 natural and synthetic. The efficiency of expression may be enhanced by the 
inclusion of enhancers appropriate to the cell system in use (Scharf, D. et al t 
(1994) Results Probl Cell Differ., 20:125-62, (1994); Bittner et al., Methods in 
Enzymol 153:516-544, (1987)). 

In a further embodiment, the present invention relates to host cells 

25 containing the above-described constructs. The host cell can be a higher 
eukaryotic cell, such as a mammalian cell, or a lower eukaryotic cell, such as a 
yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. 
Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-Dextran mediated transfection, or electroporation 

30 (Davis, L., Dibner, M., and Battey, I. (1986) Basic Methods in Molecular 
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Biology). Cell-free translation systems can also be employed to produce 
polypeptides using RNAs derived from the 'DNA constructs of the present 
invention. 

A host cell strain may be chosen for its ability to modulate the expression 
5 of the inserted sequences or to process the expressed protein in the desired 
fashion. Such modifications of the protein include, but are not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation and 
acylation. Post-translational processing which cleaves a "pre-pro" form of the 
protein may also be important for correct insertion, folding and/or function. 

10 Different host cells such as CHO, HeLa, MDCK, 293, WI38, etc. have specific 
cellular machinery and characteristic mechanisms for such post-translational 
activities and may be chosen to ensure the correct modification and processing of 
the introduced, foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 

15 expression is preferred. For example, cell lines which stably express variant 
product may be transformed using expression vectors which contain viral origins 
of replication or endogenous expression elements and a selectable marker gene. 
Following the introduction of the vector, cells may be allowed to grow for 1-2 
days in an enriched media before they are switched to selective media. The 

20 purpose of the selectable marker is to confer resistance to selection, and its 
presence allows growth and recovery of cells which successfully express the 
introduced sequences. Resistant clumps of stably transformed cells can be 
proliferated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell 

25 lines. These include, but are not limited to, the herpes simplex virus thymidine 
kinase (Wigler M., et al 9 Cell 11:223-32, (1977)) and adenine 
phosphoribosyltransferase (Lowy I., et al., Cell 22:817-23, (1980)) genes which 
can be employed in tk- or aprt- cells, respectively. Also, antimetabolite, 
antibiotic or herbicide resistance can be used as the basis for selection; for 

30 example, dhfr which confers resistance to methotrexate (Wigler M, et al, Proc. 
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NatL Acad. ScL 77:3567-70, (1980)); npt, which confers resistance to the 
aminoglycosides neomycin and G-418 (Colbere-Garapin, F. et aL, J. Mol. BioL, 
150:1-14, (1981)) and als or pat, which confer resistance to chlorsulfiiron-and 
phosphinotricin acetyltransferase, respectively (Murry, supra). Additional 
5 selectable genes have been described, for example, trpB, which allows cells to 
utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol 
in place of histidine (Hartman S.C. and R.C. Mulligan, Proc. Natl Acad. Sci. 
85:8047-51, (1988)). The use of visible markers has gained popularity with such 
markers as anthocyanins, beta-glucuronidase and its substrate, GUS, and 

io luciferase and its substrates, luciferin and ATP, being widely used not only to 
identify transformants, but also to quantify the amount of transient or stable 
protein expression attributable to a specific vector system (Rhodes, C.A. et aL, 
Methods Mol. BioL, 55:121-131,(1995)). 

Host cells transformed with a nucleotide sequence encoding variant 

15 product may be cultured under conditions suitable for the expression and 
recovery of the encoded protein from cell culture. The product produced by a 
recombinant cell may be secreted or contained intracellularly depending on the 
sequence and/or the vector used. As will be understood by those of skill in the 
art, expression vectors containing nucleic acid sequences encoding variant 

20 product can be designed with signal sequences which direct secretion of variant 
product through a prokaryotic or eukaryotic cell membrane. 

The variant product may also be expressed as a recombinant protein with 
one or more additional polypeptide domains added to facilitate protein 
purification. Such purification facilitating domains include, but are not limited 

25 to, metal chelating peptides such as histidine-tryptophan modules that allow 
purification on immobilized metals, protein A domains that allow purification on 
immobilized immunoglobulin, and the domain utilized in the FLAGS 
extension/affinity purification system (Immunex Corp, Seattle, Wash.). The 
inclusion of a protease-cleavable polypeptide linker sequence between the 

30 purification domain and variant product is useful to facilitate purification. One 
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such expression vector provides for expression of a fiision protein compromising 
a variant polypeptide fused to a polyhistidine region separated by an enterokinase 
cleavage site. The histidine residues facilitate purification on IMIAC 
(immobilized metal ion affinity chromatography, as described in Porath, et aL, 
5 Protein Expression and Purification, 3:263-281, (1992)) while the enterokinase 
cleavage site provides a means for isolating variant polypeptide from the fusion 
protein. pGEX vectors (Promega, Madison, Wis.) may also be used to express 
foreign polypeptides as fusion proteins with glutathione S-transferase (GST). In 
general, such fusion proteins are soluble and can easily be purified from lysed 

io cells by adsorption to ligand-agarose beads (e.g., glutathione-agarose in the case 
of GST-fusions) followed by elution in the presence of free ligand. 

Following transformation of a suitable host strain and growth of the host 
strain to an appropriate cell density, the selected promoter is induced by 
appropriate means (e.g., temperature shift or chemical induction) and cells are 

15 cultured for an additional period. Cells are typically harvested by centrifugation, 
disrupted by physical or chemical means, and the resulting crude extract retained 
for further purification. Microbial cells employed in expression of proteins can 
be disrupted by any convenient method, including freeze-thaw cycling, 
sonication, mechanical disruption, or use of cell lysing agents, or other methods, 

20 which are well know to those skilled in the art. 

The variant products can be recovered and purified from recombinant cell 
cultures by any of a number of methods well known in the art, including 
ammonium sulfate or ethanol precipitation, acid extraction, anion or cation 
exchange chromatography, phosphocellulose chromatography, hydrophobic 

25 interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography, and lectin chromatography. Protein refolding steps can be used, 
as necessary, in completing configuration of the mature protein. Finally, high 
performance liquid chromatography (HPLC) can be employed for final 
purification steps. 

30 
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C. Diagnostic applications utilizing nucleic acid sequences 

The nucleic acid sequences of the present invention may be used for a 
variety of diagnostic purposes. The nucleic acid sequences may be used to detect 
and quantitate expression of the variant in patient's cells, e.g. biopsied tissues, by 

5 detecting the presence of mRNA coding for variant product. Alternatively, the 
assay may be used to detect soluble variant in the serum or blood. This assay 
typically involves obtaining total mRNA from the tissue or serum and contacting 
the mRNA with a nucleic acid probe. The probe is a nucleic acid molecule of at 
least 20 nucleotides, preferably 20-30 nucleotides, capable of specifically 

io hybridizing with a sequence included within the sequence of a nucleic acid 
molecule encoding variant product under hybridizing conditions, detecting the 
presence of mRNA hybridized to the probe, and thereby detecting the expression 
of variant. This assay can be used to distinguish between absence, presence, and 
excess expression of variant product and to monitor levels of variant expression 

15 during therapeutic intervention. In addition, the assay may be used to compare the 
levels of the variant of the invention to the levels of the original sequence from 
which it has been varied or to levels of other variants, which comparison may 
have some physiological meaning. 

The invention also contemplates the use of the nucleic acid sequences as a 

20 diagnostic for diseases resulting from inherited defective variant sequences, or 
diseases in which the ratio of the amount of the original sequence from which the 
variant was varied to the novel variants of the invention is altered. These 
sequences can be detected by comparing the sequences of the defective (i.e., 
mutant) variant coding region with that of a normal coding region. Association 

25 of the sequence coding for mutant variant product with abnormal variant product 
activity may be verified. In addition, sequences encoding mutant variant products 
can be inserted into a suitable vector for expression in a functional assay system 
(e.g., colorimetric assay, complementation experiments in a variant protein 
deficient strain of HEK293 cells) as yet another means to verify or identify 
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mutations. Once mutant genes have been identified, one can then screen 
populations of interest for carriers of the mutant gene. 

Individuals carrying mutations in the nucleic acid sequence of the present 
invention may be detected at the DNA level by a variety of techniques. Nucleic 
5 acids used for diagnosis may be obtained from a patient's cells, including but not 
limited to such as from blood, urine, saliva, placenta, tissue biopsy and autopsy 
material. Genomic DNA may be used directly for detection or may be amplified 
enzymatically by using PCR (Saiki, et al, Nature 324:163-166, (1986)) prior to 
analysis. RNA or cDNA may also be used for the same purpose. As an example, 

10 PCR primers complementary to the nucleic acid of the present invention can be 
used to identify and analyze mutations in the gene of the present invention. 
Deletions and insertions can be detected by a change in size of the amplified 
product in comparison to the normal genotype. 

Point mutations can be identified by hybridizing amplified DNA to 

15 radiolabeled RNA of the invention or alternatively, radiolabeled antisense DNA 
sequences of the invention. Sequence changes at specific locations may also be 
revealed by nuclease protection assays, such RNase and SI protection or the 
chemical cleavage method (e.g. Cotton, et alProc. Natl. Acad. Set USA, 
85:4397-4401, (1985)), or by differences in melting temperatures. "Molecular 

20 beacons" (Kostrikis L.G. et al. 9 Science 279:1228-1229, (1998)), hairpin-shaped, 
single-stranded synthetic oligo- nucleotides containing probe sequences which 
are complementary to the nucleic acid of the present invention, may also be used 
to detect point mutations or other sequence changes as well as monitor expression 
levels of variant product. Such diagnostics would be particularly useful for 

25 prenatal testing. 

Another method for detecting mutations uses two DNA probes which are 
designed to hybridize to adjacent regions of a target, with abutting bases, where 
the region of known or suspected mutation(s) is at or near the abutting bases. 
The two probes may be joined at the abutting bases, e.g., in the presence of a 

30 ligase enzyme, but only if both probes are correctly base paired in the region of 
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probe junction. The presence or absence of mutations is then detectable by the 
presence or absence of ligated probe. 

Also suitable for detecting" mutations in the variant product coding 
sequence are oligonucleotide array methods based on sequencing by 
5 hybridization (SBH), as described, for example, in U.S. Patent No. 5,547,839. In 
a typical method, the DNA target analyte is hybridized with an array of 
oligonucleotides formed on a microchip. The sequence of the target can then be 
"read" from the pattern of target binding to the array. 

io D. Gene mapping utilizing nucleic acid sequences 

The nucleic acid sequences of the present invention are also valuable for 
chromosome identification. The sequence is specifically targeted to and can 
hybridize with a particular location on an individual human chromosome. 
Moreover, there is a current need for identifying particular sites on the 

15 chromosome. Few chromosome marking reagents based on actual sequence data 
(repeat polymorphisms) are presently available for marking chromosomal 
location. The mapping of DNAs to chromosomes according to the present 
invention is an important first step in correlating those sequences with genes 
associated with disease. 

20 Briefly, sequences can be mapped to chromosomes by preparing PCR 

primers (preferably 20-30 bp) from the variant cDNA. Computer analysis of the 
3' untranslated region is used to rapidly select primers that do not span more than 
one exon in the genomic DNA, which would complicate the amplification 
process. These primers are then used for PCR screening of somatic cell hybrids 

25 containing individual human chromosomes. Only those hybrids containing the 
human gene corresponding to the primer will yield an amplified fragment. 

PCR mapping of somatic cell hybrids or using instead radiation hybrids 
are rapid procedures for assigning a particular DNA to a particular chromosome. 
Using the present invention with the same oligonucleotide primers, 

30 sublocalization can be achieved with panels of fragments from specific 
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chromosomes or pools of large genomic clones in an analogous manner. Other 
mapping strategies that can similarly be used to map to its chromosome include in 
situ hybridization, prescreening with labeled flow-sorted chromosomes and 
preselection by hybridization to construct chromosome specific-cDNA libraries. 
5 Fluorescence in situ hybridization (FISH) of a cDNA clone to a metaphase 

chromosomal spread can be used to provide a precise chromosomal location in 
one step. This technique can be used with cDNA as short as 50 or 60 bases. For a 
review of this technique, see Verma et al, Human Chromosomes: a Manual of 
Basic Techniques, (1988) Pergamon Press, New York. 

io Once a sequence has been mapped to a precise chromosomal location, the 

physical position of the sequence on the chromosome can be correlated with 
genetic map data. Such data are found, for example, in the OMIM database 
(Center for Medical Genetics, Johns Hopkins University, Baltimore, MD and 
National Center for Biotechnology Information, National Library of Medicine, 

15 Bethesda, MD). The OMIM gene map presents the cytogenetic map location of 
disease genes and other expressed genes. The OMIM database provides 
information on diseases associated with the chromosomal location. Such 
associations include the results of linkage analysis mapped to this interval, and 
the correlation of translocations and other chromosomal aberrations in this area 

20 with the advent of polygenic diseases, such as cancer, in general and prostate 
cancer in particular. 

£. Therapeutic applications of nucleic acid sequences 

Nucleic acid sequences of the invention may also be used for therapeutic 
25 purposes. Turning first to the second aspect of the invention (i.e. inhibition of 
expression of variant), expression of variant product may be modulated through 
antisense technology, which controls gene expression through hybridization of 
complementary nucleic acid sequences, i.e. antisense DNA or RNA, to the 
control, 5' or regulatory regions of the gene encoding variant product. For 
30 example, the 5' coding portion of the nucleic acid sequence sequence which 
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codes for the product of the present invention is used to design an antisense 
oligonucleotide of from about 10 to 40 base pairs in length. Oligonucleotides 
derived from the transcription start site, e.g. between positions -10 and +10 from 
the start site, are preferred. An antisense DNA oligonucleotide is designed to be 
5 complementary to a region of the nucleic acid sequence involved in transcription 
(Lee et aL, Nucl Acids, Res., 6:3073, (1979); Cooney et al., Science 241:456, 
(1988); and Dervan et al, Science 251:1360, (1991)), thereby preventing 
transcription and the production of the variant products. An antisense RNA 
oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the 

io mRNA molecule into the variant products (Okano J. Neurochem. 56:560, 
(1991)). The antisense constructs can be delivered to cells by procedures known 
in the art such that the antisense RNA or DNA may be expressed in vivo. The 
antisense may be antisense mRNA or DNA sequence capable of coding such 
antisense mRNA. The antisense mRNA or the DNA coding thereof can be 

15 complementary to the full sequence of nucleic acid sequences coding for the 
variant protein or to a fragment of such a sequence which is sufficient to inhibit 
production of a protein product. 

Turning now to the first aspect of the invention, i.e. expression of variant, 
expression of variant product may be increased by providing coding sequences 

20 for coding for said product under the control of suitable control elements ending 
its expression in the desired host. 

The nucleic acid sequences of the invention may be employed in 
combination with a suitable pharmaceutical carrier. Such compositions comprise 
a therapeutically effective amount of the compound, and a pharmaceutically 

25 acceptable carrier or excipient. Such a carrier includes but is not limited to saline, 
buffered saline, dextrose, water, glycerol, ethanol, and combinations thereof. The 
formulation should suit the mode of administration. 

The products of the invention as well as any activators and deactivators 
compounds (see below) which are polypeptides, may also be employed in 

30 accordance with the present invention by expression of such polypeptides in vivo, 
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which is often referred to as "gene therapy." Cells from a patient may be 
engineered with a nucleic acid sequence (DNA or RNA) encoding a polypeptide 
ex vivo, with the engineered cells then being provided to a patient to be treated 
with the polypeptide. Such methods are well-known in the art. For example, cells 
5 may be engineered by procedures known in the art by use of a retroviral particle 
containing RNA encoding a polypeptide of the present invention. 

Similarly, cells may be engineered in vivo for expression of a polypeptide 
in vivo by procedures known in the art. As known in the art, a producer cell for 
producing a retroviral particle containing RNA encoding the polypeptide of the 

10 present invention may be administered to a patient for engineering cells in vivo 
and expression of the polypeptide in vivo. These and other methods for 
administering a product of the present invention by such method should be 
apparent to those skilled in the art from the teachings of the present invention. 
For example, the expression vehicle for engineering cells may be other than a 

15 retrovirus, for example, an adenovirus which may be used to engineer cells in 
vivo after combination with a suitable delivery vehicle. 

Retroviruses from which the retroviral plasmid vectors mentioned above 
may be derived include, but are not limited to, Moloney Murine Leukemia Virus, 
spleen necrosis virus, retroviruses such as Rous Sarcoma Virus, Harvey Sarcoma 

20 Virus, avian leukosis virus, gibbon ape leukemia virus, human immunodeficiency 
virus, adenovirus, Myeloproliferative Sarcoma Virus, and mammary tumor virus. 

The retroviral plasmid vector is employed to transduce packaging cell 
lines to form producer cell lines. Examples of packaging cells which may be 
transfected include, but are not limited to, the PE501, PA317, psi-2, psi-AM, 

25 PA12, T19-14X, VT-19-17-H2, psi-CRE, psi-CRIP, GP+E-86, GP+envAml2, 
and DAN cell lines as described in Miller (Human Gene Therapy, Vol. 1, 
pg. 5-14, (1990)). The vector may transduce the packaging cells through any 
means known in the art. Such means include, but are not limited to, 
electroporation, the use of liposomes, and CaPC>4 precipitation. In one alternative, 
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the retroviral plasmid vector may be encapsulated into a liposome, or coupled to a 
lipid, and then administered to a host. 

The producer cell line generates infectious retroviral vector particles 
which include the nucleic acid sequence(s) encoding the polypeptides. Such 
5 retroviral vector particles then may be employed, to transduce eukaryotic cells, 
either in vitro or in vivo. The transduced eukaryotic cells will express the nucleic 
acid sequence(s) encoding the polypeptide. Eukaryotic cells which may be 
transduced include, but are not limited to, embryonic stem cells, embryonic 
carcinoma cells, as well as hematopoietic stem cells, hepatocytes, fibroblasts, 

10 myoblasts, keratinocytes, endothelial cells, and bronchial epithelial cells. 

The genes introduced into cells may be placed under the control of 
inducible promoters, such as the radiation-inducible Egr-1 promoter, (Maceri, 
H.J., et ai, Cancer Res., 56(19):4311 (1996)), to stimulate variant production or 
antisense inhibition in response to radiation, eg., radiation therapy for treating 

15 tumors. 

Example III. Variant product 

The substantially purified variant product of the invention has been 
defined above as the product coded from the nucleic acid sequence of the 

20 invention. Preferably the amino acid sequence is an amino acid sequence having 
at least 90% identity to any one of the sequences identified as SEQ ID NO: 35 to 
SEQ ID NO: 68 provided that the amino acid sequence is not identical to that of 
the original sequence from which it has been varied. The protein or polypeptide 
may be in mature and/or modified form, also as defined above. Also 

25 contemplated are protein fragments having at least 10 contiguous amino acid 
residues, preferably at least 10-20 residues, derived from the variant product, as 
well as homologues as explained above. 

The sequence variations are preferably those that are considered conserved 
substitutions, as defined above. Thus, for example, a protein with a sequence 

30 having at least 90% sequence identity with any of the products identified as SEQ 
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ID NO: 35 to SEQ ID NO: 68, preferably by utilizing conserved substitutions as 
defined above is also part of the invention, and provided that it is not identical to 
the original peptide from which it has been varied. In a more specific 
embodiment, the protein has or contains any one of the sequence identified as 
5 SEQ ID NO: 35 to SEQ ID NO: 68. The variant product may be (i) one in which 
one or more of the amino acid residues in a sequence listed above are substituted 
with a conserved or non-conserved amino acid residue (preferably a conserved 
amino acid residue), or (ii) one in which one or more of the amino acid residues 
includes a substituent group, or (iii) one in which the variant product is fused 

10 with another compound, such as a compound to increase the half-life of the 
protein (for example, polyethylene glycol (PEG)), or a moiety which serves as 
targeting means to direct the protein to its target tissue or target cell population 
(such as an antibody), or (iv) one in which additional amino acids are fused to the 
variant product. Such fragments, variants and derivatives are deemed to be within 

15 the scope of those skilled in the art from the teachings herein. 

A. Preparation of variant product 

Recombinant methods for producing and isolating the variant product, and 
fragments of the protein are described above. 

20 In addition to recombinant production, fragments and portions of variant 

product may be produced by direct peptide synthesis using solid-phase techniques 
(cf. Stewart et al., (1969) Solid-Phase Peptide Synthesis, WH Freeman Co, San 
Francisco; Merrifield J., J. Am. Chem. Soc, 85:2149-2154, (1963)). In vitro 
peptide synthesis may be performed using manual techniques or by automation. 

25 Automated synthesis may be achieved, for example, using Applied Biosystems 
431 A Peptide Synthesizer (Perkin Elmer, Foster City, Calif.) in accordance with 
the instructions provided by the manufacturer. Fragments of variant product may 
be chemically synthesized separately and combined using chemical methods to 
produce the full length molecule. 



30 
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II. Therapeutic uses and compositions utilizing the variant product 

The variant product of the invention is generally useful in treating diseases 
and disorders which are characterized by a lower than normal level of variant 
5 expression, and or diseases which can be cured or ameliorated by raising the level 
of the variant product, even if the level is normal. 

Variant products or fragments may be administered by any of a number of 
routes and methods designed to provide a consistent and predictable 
concentration of compound at the target organ or tissue. The product-containing 
10 compositions may be administered alone or in combination with other agents, 
such as stabilizing compounds, and/or in combination with other pharmaceutical 
agents such as drugs or hormones. 

Variant product-containing compositions may be administered by a 
number of routes including, but not limited to oral, intravenous, intramuscular, 
15 transdermal, subcutaneous, topical, sublingual, or rectal means as well as by nasal 
application. Variant product-containing compositions may also be administered 
via liposomes. Such administration routes and appropriate formulations are 
generally known to those of skill in the art. 

The product can be given via intravenous or intraperitoneal injection. 
20 Similarly, the product may be injected to other localized regions of the body. The 
product may also be administered via nasal insufflation. Enteral administration is 
also possible. For such administration, the product should be formulated into an 
appropriate capsule or elixir for oral administration, or into a suppository for 
rectal administration. 

25 The foregoing exemplary administration modes will likely require that the 

product be formulated into an appropriate carrier, including ointments, gels, 
suppositories. Appropriate formulations are well known to persons skilled in the 
art. 

Dosage of the product will vary, depending upon the potency and 
30 therapeutic index of the particular polypeptide selected. 
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A therapeutic composition for use in the treatment method can include the 
product in a sterile injectable solution, the polypeptide in an oral delivery vehicle, 
the product in an aerosol suitable for nasal administration, or the product in a 
nebulized form, all prepared according to well known methods. Such 

5 compositions comprise a therapeutically effective amount of the compound, and a 
pharmaceutically acceptable carrier or excipient. Such a carrier includes but is not 
limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and 
combinations thereof. The product of the invention may also be used to modulate 
endothelial differentiation and proliferation as well as to modulate apoptosis 

10 either ex vivo or in vitro, for example, in cell cultures. 

Example IV. Screening methods for activators and deactivators (inhibitors) 

The present invention also includes an assay for identifying molecules, 
15 such as synthetic drugs, antibodies, peptides, or other molecules, which have a 
modulating effect on the activity of the variant product, e.g. activators or 
deactivators of the variant product of the present invention. Such an assay 
comprises the steps of providing an variant product encoded by the nucleic acid 
sequences of the present invention, contacting the variant protein with one or 
20 more candidate molecules to determine the candidate molecules modulating 
effect on the activity of the variant product, and selecting from the molecules a 
candidate's molecule capable of modulating variant product physiological 
activity. 

The variant product, its catalytic or immunogenic fragments or 
25 oligopeptides thereof, can be used for screening therapeutic compounds in any of 
a variety of drug screening techniques. The fragment employed in such a test 
may be free in solution, affixed to a solid support, borne on a cell membrane or 
located intracellularly. The formation of binding complexes, between variant 
product and the agent being tested, may be measured. Alternatively, the activator 
30 or deactivator may work by serving as agonist or antagonist, respectively, of the 
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variant receptor, binding entity or target site, and their effect may be determined 
in connection with any of the above. 

Another technique for drug screening which may be used provides for 
high throughput screening of compounds having suitable binding affinity to the 

5 variant product is described in detail by Geysen in PCT Application WO 
84/03564, published on Sep. 13, 1984. In summary, large numbers of different 
small peptide test compounds are synthesized on a solid substrate, such as plastic 
pins or some other surface. The peptide test compounds are reacted with the full 
variant product or with fragments of variant product and washed. Bound variant 

10 product is then detected by methods well known in the art. Substantially purified 
variant product can also be coated directly onto plates for use in the 
aforementioned drug screening techniques. Alternatively, non-neutralizing 
antibodies can be used to capture the peptide and immobilize it on a solid 
support. 

15 Antibodies to the variant product, as described in Example VI below, may 

also be used in screening assays according to methods well known in the art. For 
example, a "sandwich" assay may be performed, in which an anti-variant 
antibody is affixed to a solid surface such as a microliter plate and variant 
product is added. Such an assay can be used to capture compounds which bind to 

20 the variant product. Alternatively, such an assay may be used to measure the 
ability of compounds to influence with the binding of variant product to the 
variant receptor, and then select those compounds which effect the binding. 

Example V. Anti-variant antibodies 

25 A. Synthesis 

In still another aspect of the invention, the purified variant product is used 
to produce anti-variant antibodies which have diagnostic and therapeutic uses 
related to the activity, distribution, and expression of the variant product. 

Antibodies to the variant product may be generated by methods well 
30 known in the art. Such antibodies may include, but are not limited to, polyclonal, 



monoclonal, chimeric, humanized, single chain, Fab fragments and fragments 
produced by an Fab expression library. Antibodies, i.e., those which inhibit 
dimer formation, are especially preferred for therapeutic use. 

A fragment of the variant product for antibody induction does not require 
biological activity but have to feature immunological activity; however, the 
protein fragment or oligopeptide must be antigenic. Peptides used to induce 
specific antibodies may have an amino acid sequence consisting of at least five 
amino acids, preferably at least 10 amino acids of the sequences specified in any 
one of SEQ ID NO: 35 to SEQ ID NO: 68. Preferably they should mimic a 
portion of the amino acid sequence of the natural protein and may contain the 
entire amino acid sequence of a small, naturally occurring molecule. Short 
stretches of variant protein amino acids may be fused with those of another 
protein such as keyhole limpet hemocyanin and antibody produced against the 
chimeric molecule. Procedures well known in the art can be used for the 
production of antibodies to variant product. 

For the production of antibodies, various hosts including goats, rabbits, 
rats, mice, etc may be immunized by injection with variant product or any 
portion, fragment or oligopeptide which retains immunogenic properties. 
Depending on the host species, various adjuvants may be used to increase 
immunological response. Such adjuvants include but are not limited to Freund's, 
mineral gels such as aluminum hydroxide, and surface active substances such as 
lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, and dinitrophenol. BCG (bacilli Calmette-Guerin) and 
Corynebacterium parvum are potentially useful human adjuvants. 

Monoclonal antibodies to variant protein may be prepared using any 
technique which provides for the production of antibody molecules by continuous 
cell lines in culture. These include but are not limited to the hybridoma technique 
originally described by Koehler and Milstein {Nature 256:495-497, (1975)), the 
human B-cell hybridoma technique (Kosbor et al, Immunol Today 4:72, (1983); 



Cote et al, Proc. Natl. Acad. Sci. 80:2026-2030, (1983)) and the EBV-hybridoma 
technique (Cole, etal, Mol. Cell Biol. 62:109-120, (1984)). 

Techniques developed for the production of "chimeric antibodies", the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule 
with appropriate antigen specificity and biological activity can also be used 
(Morrison et al, Proc. Natl. Acad. Sci. 81:6851-6855, (1984); Neuberger et al, 
Nature 312:604-608, (1984); Takeda et al, Nature 314:452-454, (1985)). 
Alternatively, techniques described for the production of single chain antibodies 
(U.S. Pat. No. 4,946,778) can be adapted to produce single-chain antibodies 
specific for the variant protein. 

Antibodies may also be produced by inducing in vivo production in the 
lymphocyte population or by screening recombinant immunoglobulin libraries or 
panels of highly specific binding reagents as disclosed in Orlandi et al. {Proc. 
Natl. Acad. Sci. 86:3833-3837, 1989)), and Winter G and Milstein C, (Nature 

349:293-299, (1991)). 

Antibody fragments which contain specific binding sites for variant 
protein may also be generated. For example, such fragments include, but are not 
limited to, the F(ab') 2 fragments which can be produced by pepsin digestion of 
the antibody molecule and the Fab fragments which can be generated by reducing 
the disulfide bridges of the F(ab') 2 fragments. Alternatively, Fab expression 
libraries may be constructed to allow rapid and easy identification of monoclonal 
Fab fragments with the desired specificity (Huse W.D. et al, Science 
256:1275-1281,(1989)). 



B. Diagnostic applications of antibodies 

A variety of protocols for competitive binding or immunoradiometric 
assays using either polyclonal or monoclonal antibodies with established 
specificities are well known in the art. Such immunoassays typically involve the 
formation of complexes between the variant product and its specific antibody and 
the measurement of complex formation. A two-site, monoclonal-based 
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immunoassay utilizing monoclonal antibodies reactive to two noninterfering 
epitopes on a specific variant product is preferred, but a competitive binding 
assay may also be employed. These assays are described in Maddox D.E., et al, 
(7. Exp. Med. 158:1211, (1983)). 
5 Antibodies which specifically bind variant product are useful for the 

diagnosis of conditions or diseases characterized by expression of the novel 
variant of the invention (where normally it is not expressed) by over or under 
expression of variant as well as for detection of diseases in which the proportion 
between the amount of the variants of the invention and the original sequence 

io from which it varied is altered. Alternatively, such antibodies may be used in 
assays to monitor patients being treated with variant product, its activators, or its 
deactivators. Diagnostic assays for variant protein include methods utilizing the 
antibody and a label to detect variant product in human body fluids or extracts of 
cells or tissues. The products and antibodies of the present invention may be 

15 used with or without modification. Frequently, the proteins and antibodies will be 
labeled by joining them, either covalently or noncovalently, with a reporter 
molecule. A wide variety of reporter molecules are known in the art. 

A variety of protocols for measuring the variant product, using either 
polyclonal or monoclonal antibodies specific for the respective protein are known 

20 in the art. Examples include enzyme-linked immunosorbent assay (ELISA), 
radioimmunoassay (RIA), and fluorescent activated cell sorting (FACS). As 
noted above, a two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on variant product is 
preferred, but a competitive binding assay may be employed. These assays are 

25 described, among other places, in Maddox, et al (supra). Such protocols provide 
a basis for diagnosing altered or abnormal levels of variant product expression. 
Normal or standard values for variant product expression are established by 
combining body fluids or cell extracts taken from normal subjects, preferably 
human, with antibody to variant product under conditions suitable for complex 

30 formation which are well known in the art. The amount of standard complex 
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formation may be quantified by various methods, preferably by photometric 
methods. Then, standard values obtained from normal samples may be compared 
with values obtained from samples from subjects potentially affected by disease. 
Deviation between standard and subject values establishes the presence of disease 
5 state. 

The antibody assays are useful to determine the level of variant product 
present in a body fluid sample, in order to determine whether it is being 
expressed at all, whether it is being overexpressed or underexpressed in the 
tissue, or as an indication of how variant levels of variable products are 

10 responding to drug treatment. 

By another aspect the invention concerns methods for determining the 
presence or level of various anti-variant antibodies in a biological sample 
obtained from patients, such as blood or serum sample using as an antigen the 
variant product. Determination of said antibodies may be indicative to a plurality 

15 of pathological conditions or diseases. 

C. Therapeutic uses of antibodies 

In addition to their diagnostic use the antibodies may have a therapeutical 
utility in blocking or decreasing the activity of the variant product in pathological 

20 conditions where beneficial effect can be achieved by such a decrease. 

The antibody employed is preferably a humanized monoclonal antibody, 
or a human Mab produced by known globulin-gene library methods. The 
antibody is administered typically as a sterile solution by IV injection, although 
other parenteral routes may be suitable. Typically, the antibody is administered 

25 in an amount between about 1-15 mg/kg body weight of the subject. Treatment is 
continued, e.g., with dosing every 1-7 days, until a therapeutic improvement is 
seen. 

Although the invention has been described with reference to specific 
methods and embodiments, it is appreciated that various modifications and 
30 changes may be made without departing from the invention. 
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CLAIMS: 

1. An isolated nucleic acid sequence, of an alternative splicing variant, selected 
from the group consisting of: 

(i) the nucleic acid sequence depicted in any one of SEQ ID NO: 1 to 
5 SEQ ID NO: 34; 

(ii) nucleic acid sequences having at least 90% identity with the 
sequence of (i) with the proviso that each sequence is different than the original 
nucleic acid sequence from which the sequences of (i) have been varied by 
alternative splicing; and 

20 (iii) fragments of (i) or (ii) of at least 20 b.p., provided that said fragment 

contains a sequence which is not present, as a continuous stretch of nucleotides, in 
the original nucleic acid sequence from which the sequences of (i) have been 
varied by alternative splicing. 

2. An isolated nucleic acid sequence complementary to the nucleic acid 
15 sequence of Claim 1. 

3. An amino acid sequence selected from the group consisting of: 

(i) an amino acid sequence coded by the isolated nucleic acid sequence 
of alternative splice variants of Claim 1; 

(1) homologues of the amino acid sequences of (i) in which one or more 
20 amino acids has been added, deleted, replaced or chemically 

modified in the region or adjacent to the region where the amino 
acid sequences differs from the original amino acid sequence, coded 
by the original nucleic acid sequence from which the variant has been 
varied. 

25 4. An amino acid sequence according to Claim 3, as depicted in any one of 
SEQ ID NO: 35 to SEQ ID NO: 68. 

5, An isolated nucleic acid sequence coding for any one of the amino acid 
sequences of Claim 3 or 4. 

6. A purified antibody which binds specifically to any of the amino acid 
30 sequence of Claim 3 or 4. 



7. An expression vector comprising any one of the nucleic acid sequences of 
Claim 1 or 5 and control elements for the expression of the nucleic acid sequence in 
a suitable host. 

8. An expression vector comprising any one of the nucleic acid sequences of 
Claim 2, and control elements for the expression of the nucleic acid sequences in a 
suitable host. 

9. A host cell transfected by the expression vector of Claim 7 or 8. 

10. A pharmaceutical composition comprising a pharmaceutical^ acceptable 
carrier and as an active ingredient an agent selected from the group consisting of: 

(i) the expression vector of Claim 7; and 

(ii) any one of the amino acid sequences of Claim 3 or 4. 

11. A pharmaceutical composition according to Claim 10, for treatment of 
diseases which can be ameliorated or cured by raising the level of any one of the 
amino acid sequences depicted in SEQ ID NO: 35 to SEQ ID NO: 68. 

12. A pharmaceutical composition comprising a pharmaceutically acceptable 
carrier and as an active ingredient an agent selected from the group consisting of: 

(i) any one of the nucleic acid sequences of Claim 2; 

(ii) the expression vector of Claim 8; and 

(iii) the purified antibody of Claim 6. 

13. A pharmaceutical composition according to Claim 12, for treatment of 
diseases which can be ameliorated or cured by decreasing the level of any one of 
the amino acid sequences depicted in SEQ ID NO: 35 to SEQ ID NO: 68. 

14. A method for detecting an variant nucleic acid sequence in a biological 
sample, comprising the steps of: 

(a) hybridizing to nucleic acid material of said biological sample any one 
of the nucleic acid sequences of Claim 1 or 2; and 

(b) detecting said hybridization complex; 

wherein the presence of said hybridization complex correlates with the 
presence of an variant nucleic acid sequence in the said biological sample. 

15. A method for determining the level of variant nucleic acid sequences in a 
biological sample comprising the steps of: 
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(a) hybridizing to nucleic acid material of said biological sample any one 
of the nucleic acid sequences of Claim 1 or 2; and 

(b) determining the amount of hybridization complexes and normalizing 
said amount to provide the level of the variant nucleic acid sequences in the 

5 sample. 

16. A method for determining the ratio between the level of variant of the 
nucleic acid sequence in a first biological sample and the level of the original 
sequence from which the variant has been varied by alternative splicing in a second 
biological sample comprising: 
10 (i) determining the level of the variant nucleic acid sequence in the first 

biological sample according to the method of Claim 15; 

(ii) determining the level of the original sequence in the second biological 
sample; and 

(iii) comprising the levels obtained in (a) and (b) to give said ratio. 

15 17. A method according to Claim 16, wherein said first and said second 
biological samples are the same sample. 

18. A method according to any of Claims 14 to 17, wherein the nucleic acid 
material of said biological sample are mRNA transcripts. 

19. A method according to Claim 18, where the nucleic acid sequence is present 
20 in a nucleic acid chip. 

20. A method for identifying candidate compounds capable of binding to the 
variant product and modulating its activity the method comprising: 

(i) providing any one of the amino acid sequences as defined in Claim 3 

or 4; 

25 (ii) contacting a candidate compound with said amino acid sequence; 

(iii) determining the effect of said candidate compound on the biological 
activity of said protein or polypeptide and selecting those compounds which show a 
significant effect on said biological activity. 

21. A method according to Claim 20, wherein the compound is an activator and 
30 the measured effect is increase in the biological activity. 
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22. A method according to Claim 20, wherein the compound is an deactivator 
and the effect is decrease in the biological activity. 

23. An activator of any one of the amino acid sequences of Claim 3 or 4. 

24. An deactivator of any one of the amino acid sequences of Claims 3 or 4. 

5 25. A method for detecting any one of the amino acid sequences of Claim 3 or 4 
in a biological sample, comprising the steps of: 

(a) contacting with said biological sample the antibody of Claim 8, 
thereby forming an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

10 wherein the presence of said antibody-antigen complex correlates with the 

presence of the desired amino acid in said biological sample. 

26. A method for detecting the level of the amino acid sequence of any one of 
Claim 3 or 4 in a biological sample, comprising the steps of: 

(a) contacting with said biological sample the antibody of Claim 8, 
15 thereby forming an antibody-antigen complex; and 

(b) detecting the amount of said antibody-antigen complex and 
normalizing said amount to provide the level of said amino acid sequence in the 
sample. 

27. A method for determining the ratio between the level of any one of the 
20 amino acid sequences of Claims 3 or 4 present in a first biological sample and the 

level of the original amino acid sequences from which they were varied by 
alternative splicing, present in a second biological sample, the method comprising: 
(i) determining the level of the amino acid sequences of Claims 3 or 4 
into a first sample by the method of Claim 26; 
25 (ii) determining the level of the original amino acid sequence in the 

second sample; and 

(iii) comparing the level obtained in (a) and (b) to give said ratio. 

28. A method according to Claim 27, wherein said first and said second 
biological samples are the same sample. 

30 29. A method for detecting any one of the antibodies of Claim 6 in a biological 
sample comprising the steps of: 



(a) contacting said biological sample with any one of the amino acid 
sequences of Claim 3 or 4 thereby forming an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

wherein the presence of said antibody-antigen complex correlates with the 
presence of the antibody in said biological sample. 

30. A method for detecting the level of any one of the antibodies of Claim 6 in a 
biological sample comprising the steps of: 

(i) contacting said biological sample with any one of the amino acid 
sequences of Claim 3 ; 

(ii) detecting the amount of said antibody-antigen complex and 
normalizing said amount to provide the levels of said antibody in the sample. 

For the Applicants, 

REINHOLD COHN AND PARTNERS 
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SEQUENCE LISTING 



<110> COMPUGEN LTD. 

<120> SPLICE VARIANTS OF ONCOGENES 
<130> 1231190 - COMPUGEN 



<140> 
<141> 

<160> 68 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1735 
<212> DNA 
<213> Humanus 

<400> 1 



gtccaccgga agcgagt.tgc gacacggcag gttcccgccc ggaagaagcg accaaagcgc 60 
ctgaggaccg gcaacatggt gcggtcgggg aataaggcag ctgttgtgct gtgtatggac 12 0 
gtgggcttta ccatgagtaa ctccattcct ggtatagaat occcatttga acLgcaaag 180 
tlt g r,? a t a l ^catgtttgt acagcgacag gtgtttgctg agaacaagga tgagattgct 24 0 
ttagtcctgt ttggtacaga tggcactgac aatccccttt ctggtgggga tcagtatcag 300 
aacatcacag tgcacagaca tctgatgcta ccagattttg atttgctgga ggacattgaa 360 
agcaaaatcc aaccaggttc tcaacaggct gacttcctgg atgcactaat cgtgagcatg 420 
gatgtgattc aacatgaaac aataggaaag aagtttgaga agaggcatat tgaaatattc 480 
actgacctca gcagccgatt cagcaaaagt cagctggata ttataattca tagcttgaag 540 
aaatgtgaca tctccctgca attcttcttg cctttctcac ttggcaagga agatggaagt 600 
ggggacagag gagatggccc ctttcgctta ggtggccatg ggccttcctt tccactaaaa 660 
ggaattaccg aacagcaaaa agaaggtctt gagatagtga aaatggtgat gatatcttta 720 
gaaggtgaag atgggttgga tgaaatttat tcattcagtg agagtctgag aaaactgtgc 780 
SS^ 0 ! 3 ? 3 aaatt 9 a 9 a 9 gcattccatt cactggccct gccgactgac cattggctcc 84 0 
t^5 ^ ^ aag ^ttgc agcctataaa tcgattctac aggagagagt taaaaagact 900 
tggacagttg tggatgcaaa aaccctaaaa aaagaagata tacaaaaaga aacagtttat 960 
tgcttaaatg atgatgatga aactgaagtt ttaaaagagg atattattca agggttccgc 1020 
tatggaagtg atatagttcc tttctctaaa gtggatgagg aacaaatgaa atataaatcg 1080 
gaggggaagt gcttctctgt tttgggattt tgtaaatctt ctcaggttca gagaagattc 1140 
Ht^tt ga t atcaagttct aaaggtcttt gcagcaagag atgatgaggc agctgcagtt 1200 
gcactttcct ccctgattca tgctttggat gacttagaca tggtggccat agttcgaLt 1260 
lltl^tt aaa ? a ? Ctaa tcctcaagtc ggcgtggctt ttcctcatat caagcataac 1320 
tatgagtgtt tagtgtatgt gcagctgcct ttcatggaag acttgcggca atacatgttt 138 0 
Zltlt !r aaaa " gtaa gaaatatgct cccacogagg cacagttgaa tgctgttgat 1440 
ttltlt 9 actccat 9 a g cttggcaaag aaagatgaga agacagacac ccttgaagac 1500 
ttgtttccaa ccaccaaaat cccaaatcct cgatttcaga gattatttca ggtaagagaa 1560 
gaaggatgaa caagtcatat ttcttttaaa tgaaagagag ctaagtgcaa agttgcggta 1620 
attggccagt cctaaataaa tgtctttttt ctctgaaggc cocatttgct ctatttaatg 1680 
gaagaataac atcttcctct cctacttcag aaggtaaaaa tattctgagc tctta 1735 



<210> 2 
<211> 2766 
<212> DNA 
<213> Humanus 

<400> 2 

gtccaccgga agcgagttgc gacacggcag 

ctgaggaccg gcaacatggt gcggtcgggg 

gtgggcttta ccatgagtaa ctccattcct 



gttcccgccc ggaagaagcg accaaagcgc 60 
aataaggcag ctgttgtgct gtgtatggac 120 
ggtatagaat ccccatttga acaagcaaag 180 



1 



aaggtgataa 
ttagtcctgt 
aacatcacag 
agcaaaatcc 
gatgtgattc 
actgacctca 
aaatgtgaca 
ggggacagag 
ggaattaccg 
gaaggtgaag 
gtcttcaaga 
aatttgtcta 
tggacagttg 
tgcttaaatg 
cagattcctc 
caagtgactg 
ttaaagactg 
acctctgttg 
gccagctttg 
aatgaaacac 
attaagtttt 
gaaattaaac 
accaaagagg 
cccaaagaca 
gatttattgg 
tgtgatgctg 
ctcaagaaat 
acacacatat 
ttattttctg 
catgctttga 
cagcccttgt 
cttaaccctt 
gagcaagcag 
aggacggact 
gtatataggt 
cttttaagca 
tgccatatca 
ttatatcact 
aaacacttag 
ggatggtgtc 
tcctcattct 
cacctctgtc 
ttaaaggcat 
atgtgt 



ccatgtttgt 
ttggtacaga 
tgcacagaca 
aaccaggttc 
aacatgaaac. 
gcagccgatt 
tctccctgca 
gagatggccc 
aacagcaaaa 
atgggttgga 
aaattgagag 
taaggattgc 
tggatgcaaa 
atgatgatga 
tctctaaaat 
ctcaggaaat 
agcaaggggg 
gaagtgtgaa 
aggaagcgag 
cgtattttat 
cagaagagca 
aattaaatca 
aagcctctgg 
aaccaagtgg 
acatgatata 
ggagttctaa 
tcccagcagg 
atattacaag 
gttggtgtat 
agtttctgga 
gatgtgatta 
tccagagtcc 
tagcatttgc 
cacttctctt 
ggttttcttt 
gtgagttatg 
gtgaaccttt 
tcactgttct 
gtaacttttc 
tcctttacta 
tgccttgagt 
ttcagcaccc 
aaatgactca 



acagcgacag 
tggcactgac 
tctgatgcta 
tcaacaggct 
aataggaaag 
cagcaaaagt 
attcttcttg 
ctttcgctta 
agaaggtctt 
tgaaatttat 
gcattccatt 
agcctataaa 
aaccctaaaa 
aactgaactg 
aaagaccctt 
tttccaagac 
agcccacttc 
tcctgctgaa 
taaccagctc 
gaagagcata 
gcgctttaac 
tttctgggaa 
aagttctgtc 
agacacagca 
ggtcgtggat 
caaaacaagt 
ttacctggag 
ggataattta 
tattttttct 
aagtagatct 
gtgtctcatg 
tcctttgcct 
ttcctcccaa 
tttagttgag 
aagtggggtg 
gtggtggtct 
agaaactcaa 
ctacttgcaa 
ggtctttccc 
aataagaaaa 
tccagttcct 
tcataagtcg 
tcaaaaaaaa 



gtgtttgctg 
aatccccttt 
ccagattttg 
gacttcctgg 
aagtttgaga 
cagctggata 
cctttctcac 
ggtggccatg 
gagatagtga 
tcattcagtg 
cactggccct 
tcgattctac 
aaagaagata 
aatcctcccg 
tttcctctga 
aaccatgaag 
agcgtctcca 
aacttccgtg 
ataaatcaca 
gactgcatcc 
aacttcctga 
attgttgtcc 
acagctgagg 
gctgtatttg 
gtatggggaa 
tggatgcggc 
gcggatcatc 
gaccccatac 
gtggtcttac 
tttcttgacc 
tggaaccatg 
gatcctccaa 
caagcagctg 
gccttctagt 
ggaaggggag 
catgaagaaa 
aaactgagaa 
gcctcaaaga 
atttctacct 
taacaaagcc 
ctttggtgta 
tcactaatac 
aaaaaaaaaa 



agaacaagga 
ctggtgggga 
atttgctgga 
atgcactaat 
agaggcatat 
ttataattca 
ttggcaagga 
ggccttc-ett 
aaatggtgat 
agagtctgag 
gccgactgac 
aggagagagt 
tacaaaaaga 
ctgaggtgac 
ttgaagccaa 
atggacctac 
gtctggctga 
ttctagtgaa 
tcgaacagtt 
gagccttccg 
aagcccttca 
aggatggaat 
aagccaaaaa 
aagaaggtgg 
tctaagagag 
cattcaaggg 
taattctctg 
aagtttataa 
tgatctttgt 
tagtatatca 
gcatggttat 
cagctgtcac 
ggttaggaaa 
taccacatta 
cacaatttcc 
agaccttttg 
atttactaca 
gagaaagttt 
aagtcagctt 
cttattctct 
cagacttctt 
acagttttgt 
aattatggta 



tgagattgct 
tcagtatcag 
ggacattgaa 
cgtgagcatg 
tgaaatattc 
tagcttgaag 
agatggaagt 
tccactaaaa 
gatatcttta 
aaaactgtgc 
cattggctcc 
taaaaagact 
aacagtttat 
aacgaaaagt 
gaaaaaggat 
agctaaaaaa 
aggcagtgtc 
acagaagaag 
tttggatact 
ggaagaagcc 
agagaaagtg 
tactctgatc 
gtttctggcc 
tgatgtggac 
ctgccatcgc 
gagccaaaat 
tggaatgaat 
agagtcattg 
atattacata 
gtgacagttg 
tgatgagttt 
aacttgtgtt 
accatgggta 
ctctgcctct 
cttcatactc 
gcccaatctc 
gtagttagaa 
cgttatatta 
tcatctttgt 
ttttttcttg 
ggtacccagt 
acatgtaaca 
cctcgaatca 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2766 



<210> 3 
<211> 2034 
<212> DNA 
<213> Humanus 



<400> 3 

ggagacaggt 

gccaggagga 

gtggctgcag 

gccattatcc 

aggtgcggga 

acaacctggt 

agaaggggga 

tgaccacggg 

agcccgaacc 



ggtggctacg 
ccatgtgaat 
ctcacacccg 
catagtccca 
cccactggtt 
tatcgctctg 
acagctccgc 
ccaggaaggc 
ctggttcttc 



acggcraagg 
ggggccagag 
gaagatgact 
ctggatggca 
acctacgaag 
cacagctatg 
atcctggagc 
ttcatcccct 
aagaacctga 



gagctgagac 
ggctcccggg 
ggatggaaaa 
agggcacgct 
gctccaatcc 
agccctctca 
agagcggcga 
tcaattttgt 
gccgcaagga 



tgtccaggca 
ctgggcaggg 
catcgatgtg 
gctcatccga 
gccggcttcc 
cgacggagat 
gtggtggaag 
ggccaaagcg 
cgcggagcgg 



gccaggttag 
accatgggct 
tgtgagaact 
aatggctctg 
ccactgcaag 
ctgggctttg 
gcgcagtccc 
aacagcctgg 
cagctcctgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 



2 



cgcccgggaa cactcacggc tccttcctca tccgggagag cgagagcacc gcgggatcgt 600 
tttcactgtc ggtccgggac ttcgaccaga accagggaga ggtggtgaaa cattacaaga 660 
tccgtaatct ggacaacggt ggcttctaca tctcccctcg aatcactttt cccggcctgc 720 
atgaactggt ccgccattac accaatgctt cagatgggct gtgcacacgg ttgagccgcc 780 
cctgccagac ccagaagccc cagaagccgt ggtgggagga cgagtgggag gttcccaggg 840 
agacgctgaa gctggtggag cggctggggg ctggacagtt cggggaggtg tggatggggt 900 
actacaacgg gcacacgaag gtggcggtga agagcctgaa gcagggcagc atgtccccgg 960 
acgccttcct ggccgaggcc aacetcatga agcagctgca acaccagcgg ctggttcggc 1020 
tctacgctgt ggtcacccag gagcccatct acatcatcac tgaatacatg gagaatggga 1080 
gtctagtgga ttttctcaag accccttcag gcatcaagtt gaccatcaac aaactcctgg 1140 
acatggcagc ccaaattgca gaaggcatgg cattcattga agagcggaat tatattcatc 1200 
gtgaccttcg ggctgccaac attctggtgt ctgacaccct gagctgcaag attgcagact 12 60 
ttggcctagc acgcctcatt gaggacattc accatcaagt cagatgtgtg gtcttttggg 1320 
atcctgctga cggaaattgt cacccacggc cgcatccctt acccagggat gaccaacccg 1380 
gaggtgattc agaacctgga gcgaggctac cgcatggtgc gccctgacaa ctgtccagag 1440 
gagctgtacc aactcatgag gctgtgctgg aaggagcgcc cagaggaccg gcccaccttt 1500 
gactacctgc gcagtgtgct ggaggacttc ttcacggcca cagagggcca gtaccagcct 1560 
cagccttgag aggccttgag aggccctggg gttctccccc tttctctcca gcctgacttg 1620 
gggagatgga gttcttgtgc catagtcaca tggcctatgc acatatggac tctgcacatg 1680 
aatcccaccc acatgtgaca catatgcacc ttgtgtctgt acacgtgtcc tgtagttgcg 1740 
tggactctgc acatgtcttg tacatgtgta gcctgtgcat gtatgtcttg gacactgtac 1800 
aaggtacccc tttctggctc tcccatttcc tgagaccaca gagagagggg agaagcctgg 1860 
gattgacaga agcttctgcc cacctacttt tctttcctca gatcatccag aagttcctca 1920 
agggccagga ctttatctaa tacctctgtg tgctcctcct tggtgcctgg cctggcacac 1980 
atcaggagtt caataaatgt ctgttgatga ctgttgtaca caacaacaaa acaa 2034 



<210> 4 
<211> 2282 
<212> DNA 
<213> Humanus 



<400> 4 

ggagacaggt 

gccaggagga 

gtggctgcag 

gccattatcc 

aggtgcggga 

gtgaccccag 

acctctcccc. 

caggcttcct 

tggttatcgc 

gggaacagct 

cgggccagga 

aaccctggtt 

ggaacactca 

tgtcggtccg 

atctggacaa 

tggtccgcca 

agacccagaa 

tgaagctggt 

acgggcacac 

tcctggccga 

ctgtggtcac 

tggattttct 

cagcccaaat 

ttcgggctgc 

tagcacgcct 

agtggacagc 

cttttggga t 

ccaacccgga 



ggtggctacg 
ccatgtgaat 
ctcacacccg 
catagtccca 
cccactggtt 
gcagcagggc 
cacctacttt 
gccgatccca 
tctgcacagc 
ccgcatcctg 
aggcttcatc 
cttcaagaac 
cggctccttc 
ggacttcgac 
cggtggcttc 
ttacaccaat 
gccccagaag 
ggagcggctg 
gaaggtggcg 
ggccaacctc 
ccaggagccc 
caagacccct 
tgcagaaggc 
caacattctg 
cattgaggac 
gccagaagcc 
cctgctgacg 
ggtgattcag 



acggcraagg 
ggggccagag 
gaagatgact 
ctggatggca 
acctacgaag 
ctgaaagaca 
ctccccggtc 
gctcggttct 
tatgagccct 
gagcagagcg 
cccttcaatt 
ctgagccgca 
ctcatccggg 
cagaaccagg 
tacatctccc 
gcttcagatg 
ccgtggtggg 
ggggctggac 
gtgaagagcc 
atgaagcagc 
atctacatca 
tcaggcatca 
atggcattca 
gtgtctgaca 
aacgagtaca 
attaactacg 
gaaattgtca 
aacctggagc 



gagctgagac 
ggctcccggg 
ggatggaaaa 
agggcacgct 
gctccaatcc 
aggcctgcgg 
ttgccttcct 
ccctgacgcc 
ctcacgacgg 
gcgagtggtg 
ttgtggccaa 
aggacgcgga 
agagcgagag 
gagaggtggt 
ctcgaatcac 
ggctgtgcac 
aggacgagtg 
agttcgggga 
tgaagcaggg 
tgcaacacca 
tcactgaata 
agttgaccat 
ttgaagagcg 
ccctgagctg 
cagccaggga 
ggacattcac 
cccacggccg 
gaggctaccg 



tgtccaggca 
ctgggcaggg 
catcgatgtg 
gctcatccga 
gccggcttcc 
atccctggct 
tgtcccccac 
ccttgtcttt 
agatctgggc 
gaaggcgcag 
agcgaacagc 
gcggcagctc 
caccgcggga 
gaaacattac 
ttttcccggc 
acggttgagc 
ggaggttccc 
ggtgtggatg 
cagcatgtcc 
gcggctggtt 
catggagaat 
caacaaactc 
gaattatatt 
caagattgca 
gggggccaag 
catcaagtca 
catcccttac 
catggtgcgc 



gccaggttag 
accatgggct 
tgtgagaact 
aatggctctg 
ccactgcaag 
gttggcttcc 
cctgtaactc 
acagacaacc 
tttgagaagg 
tccctgacca 
ctggagcccg 
ctggcgcccg 
tcgttttcac 
aagatccgta 
ctgcatgaac 
cgcccctgcc 
agggagacgc 
gggtactaca 
ccggacgcct 
cggctctacg 
gggagtctag 
ctggacatgg 
catcgtgacc 
gactttggcc 
tttcccatta 
gatgtgtggt 
ccagggatga 
cctgacaact 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 



3 



gtccagagga 
ccacctttga 
accagcctca 
ctgacttggg 
tgcacatgaa 
tagttgcgtg 
cactgtacaa 
aagcctggga 
gttcctcaag 
tggcacacat 
aa 



gctgtaccaa 
ctacctgcgc 
gccttgagag 
gagatggagt 
tcccacccac 
gactctgcac 
ggtacccctt 
ttgacagaag 
ggccaggact 
caggagttca 



ctcatgaggc 
agtgtgctgg 
gccttgagag 
tcttgtgcca 
atgtgacaca 
atgtcttgta 
tctggctctc 
cttctgccca 
ttatctaata 
ataaatgtct 



tgtgctggaa 
aggacttctt 
gccctggggt 
tagtcacatg 
tatgcacctt 
catgtgtagc 
ccatttcctg 
cctacttttc 
cctctgtgtg 
gttgatgact 



ggagcgccca 
cacggccaca 
tctccccctt 
gcctatgcac 
gtgtctgtac 
ctgtgcatgt 
agaccacaga 
tttcctcaga 
ctcctccttg 
gttgtacaca 



gaggaccggc 
gagggccagt 
tctctccagc 
atatggactc 
acgtgtcctg 
atgtcttgga 
gagaggggag 
tcatccagaa 
gtgcctggcc 
acaacaaaac 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2282 



<210> 5 
<211> 932 
<212> DNA 
<213> Humanus 



<400> 5 

gaggcagcag 

ccagttgcag 

agtgatttgc 

tggaagttct 

aaaccctgac 

agaaaccaat 

catggatgat 

ctcgaacgac 

ccattctgat 

agttttcact 

ttatggactg 

aacagacaca 

tcattcattc 

tatgaccatt 

tctctagatt 

agattaaaat 



cagcaggagg 
ccttctcagc 
ttttgcctcc 
gaggaaaagc 
ccatctcaga 
gactttaaac 
atggatgatg 
tctgatgatg 
gaatctgatg 
ccagttgtcc 
aggtcaaaat 
cctgatggtt 
attcatccat 
gaatacaaat 
tgattctcaa 
aaatggaaaa 



aggcagagca 
caaacgccga 
taggcatcac 
agctttacaa 
agcagaatct 
aagagaccct 
aagatgatga 
tagatgacac 
aactggtcac 
ccacagtaga 
ctaagaagtt 
ctgactagcg 
tcattcatcc 
ctttttctgc 
acacaattct 
caccaattat 



cagcatcgtc 
ccaaggaaaa 
ctgtgccata 
caaataccca 
cctagcccca 
tccaagtaag 
tgaccatgtg 
tgatgattct 
tgattttccc 
cacatatgat 
tcgcagacct 
ctcaagtcta 
attcagcaag 
ttggcggttt 
actttttgaa 
tt 



gggaccagac 
ctcactacca 
ccagttaaac 
gatgctgtgg 
cagaatgctg 
tccaacgaaa 
gacagccagg 
caccagtctg 
acggacctgc 
ggccgaggtg 
gacatccagg 
ggaaaccaca 
aattcattca 
ttgtaagtct 
atcctggatc 



tcgtctcagg 
tgagaattgc 
aggctgattc 
ccacatggct 
tgtcctctga 
gccatgacca 
actccattga 
atgagtctca 
cagcaaccga 
atagtgtggt 
taaatccttt 
gtttgcatat 
tattctactt 
acataatttc 
agttattttc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

932 



<210> 6 
<211> 1314 
<212> DNA 
<213> Humanus 

<400> 6 

agcgtcggac taccgttggt ttccgcaact tcctggatta tcctcgccaa ggactttgca 60 
atatattttt ccgccttttc tggaaggatt tcgctgcttc ccgaagtctt ggacgagcgc 120 
tctagctctg tgggaaggtt ttgggctctc tggctcggat tttgcaattt ctccctgggg 18 0 
actgccgtgg agccgcatcc actgtggatt ataattgcaa catgacgctg gaagagctcg 240 
tggcgtgcga caacgcggcg cagaagatgc agacggtgac cgccgcggtg gaggagcttt 300 
tggtggccgc tcagcgccag gatcgcctca cagtgggggt gtacgagtcg gccaagttga 360 
tgaatgtgtg agtcagaccc ccttcccggg ctgggcgcgg gtgggacggg acctcccctc 420 
cgctctggac gctttccgca cgcttgtctt gcatggagct gggacttccc caagtgcccc 480 
ccgctgtgga tgcagagctt ctctgccgtt ttgtggatcg ggggctgccg tatcctgatg 540 
tatcgtctgc aaacacccct cccgccgtgg gcctgtctcc ccctacccca tactttgaac 600 
cgtgtgccct cccctccccc caccgtcacc agcttgcaga ggcaatcccc tgcacccttg 660 
cagtttcgaa ccctcacacg gacgcctgga agagccacgg cttggtggag gtggccagct 720 
actgcgaaga aagccggggc aacaaccagt gggtccccta catctctctt caggaacgct 780 
gaggcccttc ccagcagcag aatctgttga gttgctgcca caaacaaaaa atacaataaa 840 
tatttgaacc ccctcccccc cagcacaacc cccccaaaac aacccaaccc acgaggacca 900 
tcgggggcag agtcgttgga gactgaagag gaagaggagg aggagaaggg gagtgagcgg 960 
ccgctgccgc cttccccatc acggagggtc cagactgtcc actcgggggt ggagtgagac 1020 
tgactgcaag ccccaccctc cttgagactg gagctggcgt ctgcatacga gagacttggt 1080 



4 



tgaacttggt tggtccttgt ctgcaccctc 
ggggctgaag ttgctctgta cccatgaact 
tttgttactt gcacttgtta ttcgaaccac 
tatttttatt tctactatga gggccttgta 



gacaagacca cactttggga cttgggagct 1140 
cccagtttgc gaattataga gacaatctat 1200 
tgagagcgag atgggaagca tagatatcta 1260 
ataaatttct aaagcctcaa aaaa 1314 



<210> 7 
<211> 2130 
<212> DNA 
<213> Humanus 

<400> 7 

cgctgtgtct gcccggtctc tcctggggtc tccggttctg ccgcctcctg tccctccttc 60 

cggctgcccc ggccagaggg gctcgaaggc gcgttggggc ccggggccac gctgggagac 120 

acgccttcgg ccgcgcctaa ttcgagccag gcgcgggacg cgtcccccgg ccgggcggag 180 

cgggagggga cgcccgccgc gctcggcctc cccgggcccc gacccctcct ttgtaatttg 240 

aataaaacgc ctcccccgcc cgcgcgccgc cttaacccgc cgcctccgct ctccccggct 300 

gcaggcggcg tgcaggacca gcggcggccg tgcaggcgga ggacttcggc gcggctcctc 360 

ctgggtgtga ccccgggcgc gcccgccgcg cgacgatgag ggcgcggccg caggtctgcg 420 

aggcgctgct cttcgccctg gcgctccaga ccggcgtgtg ctatggcatc aagtggctgg 480 

cgctgtccaa gacaccatcg gccctggcac tgaaccagac gcaacactgc aagcagctgg 540 

agggtctggt gtctgcacag gtgcagctgt gccgcagcaa cctggagctc atgcacacgg 600 

tggtgcacgc cgcccgcgag gtcatgaagg cctgtcgccg ggcctttgcc gacatgcgct 660 

ggaactgctc ctccattgag ctcgccccca actatttgct tgacctggag agagggaccc 720 

gggagtcggc cttcgtgtat gcgctgtcgg ccgccgccat cagccacgcc atcgcccggg 780 

cctgcacctc cggcgacctg cccggctgct cctgcggccc cgtcccaggt gagccacccg 840 

ggcccgggaa ccgctgggga agatgtgcgg acaacctcag ctacgggctc ctcatggggg 900 

ccaagttttc cgatgctcct atgaaggtga aaaaaacagg atcccaagcc aataaactga 960 

tgcgtctaca caacagtgaa gtggggagac aggctctgcg cgcctctctg gaaatgaagt 1020 

gtaagtgcca tggggtgtct ggctcctgct ccatccgcac ctgctggaag gggctgcagg 1080 

agctgcagga tgtggctgct gacctcaaga cccgatacct gtcggccacc aaggtagtgc 1140 

accgacccat gggcacccgc aagcacctgg tgcccaagga cctggatatc cggcctgtga 1200 

aggactcgga actcgtctat ctgcagagct cacctgactt ctgcatgaag aatgagaagg 1260 

tgggctccca cgggacacaa gacaggcagt gcaacaagac atccaacgga agcgacagct 1320 

gcgaccttat gtgctgctac gtcacctgcc gcaggtgtga gcgtaccgtg gagcgctatg 1380 

tctgcaagtg aggccctgcc ctccgcccca cgcaggagcg aggactctgc tcaaggaccc 1440 

tcagcaactg gggccagggg cctggagaca ctccatggag ctctgcttgt gaattccaga 1500 

tgccaggcat gggaggcggc ttgtgctttg ccttcacttg gaagccacca ggaacagaag 1560 

gtctggccac cctggaagga gggcaggaca tcaaaggaaa ccgacaagat taaaaataac 1620 

ttggcagcct gaggctctgg agtgcccaca ggctggtgta aggagcgggg cttgggatcg 1680 

gtgagactga tacagacttg acctttcagg gccacagaga ccagcctccg ggaaggggtc 1740 

tgcccgcctt cttcagaatg ttctgcggga ccccctggcc caccctgggg tctgagcctg 1800 

ctgggcccac cacatggaat cactagcttg ggttgtaaat gttttctttt gttttttgct 1860 

ttttcttcct ttgggatgtg gaagctacag aaatatttat aaaacatagc tttttctttg 1920 

gggtggcact tctcaattcc tctttatata ttttatatat ataaatatat atgtatatat 1980 

ataatgatct ctattttaaa actagctttt taagcagctg tatgaaataa atgctgagtg 2040 
agccccagcc cgcccctgca gttcccggcc tcgtcaagtg aactcggcag accctggggc 2100 
tggcagaggg agctctccag tttccgggca 2130 



<210> 8 
<211> 1847 
<212> DNA 
<213> Humanus 

<400> 8 

cgctgtgtct gcccggtctc tcctggggtc tccggttctg ccgcctcctg tccctccttc 60 

cggctgcccc ggccagaggg gctcgaaggc gcgttggggc ccggggccac gctgggagac 120 

acgccttcgg ccgcgcctaa ttcgagccag gcgcgggacg cgtcccccgg ccgggcggag 180 

cgggagggga cgcccgccgc gctcggcctc cccgggcccc gacccctcct ttgtaatttg 240 

aataaaacgc ctcccccgcc cgcgcgccgc cttaacccgc cgcctccgct ctccccggct 300 



5 



gcaggcggcg 
ctgggtgtga 
aggcgctgct 
cgctgtccaa 
agggtctggt 
tggtgcacgc 
ggaactgctc 
gggagtcggc 
ggtagtgcac 
gcctgtgaag 
tgagaaggtg 
cgacagctgc 
cgagcggtgc 
taccgtggag 
actctgctca 
tgcttgtgaa 
gccaccagga 
acaagattaa 
agcggggctt 
gcctccggga 
cctggggtct 
ttcttttgtt 
acatagcttt 
aatatatatg 
gaaataaatg 
tcggcagacc 



tgcaggacca 
ccccgggcgc 
cttcgccctg 
gacaccatcg 
gtctgcacag 
cgcccgcgag 
ctccattgag 
cttcgtgtat 
cgacccatgg 
gactcggaac 
ggctcccacg 
gaccttatgt 
cactgtaagt 
cgctatgtct 
aggaccctca 
ttccagatgc 
acagaaggtc 
aaataacttg 
gggatcggtg 
aggggtctgc 
gagcctgctg 
ttttgctttt 
ttctttgggg 
tatatatata 
ctgagtgagc 
ctggggctgg 



gcggcggccg 
gcccgccgcg 
gcgctccaga 
gccctggcac 
gtgcagctgt 
gtcatgaagg 
ctcgccccca 
gcgctgctga 
gcacccgcaa 
tcgtctatct 
ggacacaaga 
gctgcgggcg 
accactggtg 
gcaagtgagg 
gcaactgggg 
caggcatggg 
tggccaccct 
gcagcctgag 
agactgatac 
ccgccttctt 
ggcccaccac 
tcttcctttg 
tggcacttct 
atgatctcta 
cccagcccgc 
cagagggagc 



tgcaggcgga 
cgacgatgag 
ccggcgtgtg 
tgaaccagac 
gccgcagcaa 
cctgtcgccg 
actatttgct 
cctcaagacc 
gcacctggtg 
gcagagctca 
caggcagtgc 
tggctacaac 
ctgctacgtc 
ccctgccctc 
ccaggggcct 
aggcggcttg 
ggaaggaggg 
gctctggagt 
agacttgacc 
cagaatgttc 
atggaatcac 
ggatgtggaa 
caattcctct 
ttttaaaact 
ccctgcagtt 
tctccagttt 



ggacttcggc 
ggcgcggccg 
ctatggcatc 
gcaacactgc 
cctggagctc 
ggcctttgcc 
tgacctggag 
cgatacctgt 
cccaaggacc 
cctgacttct 
aacaagacat 
ccctacacag 
acctgccgca 
cgccccacgc 
ggagacactc 
tgctttgcct 
caggacatca 
gcccacaggc 
tttcagggcc 
tgcgggaccc 
tagcttgggt 
gctacagaaa 
ttatatattt 
agctttttaa 
cccggcctcg 
ccgggca 



gcggctcctc 
caggtctgcg 
aagtggctgg 
aagcagctgg 
atgcacacgg 
gacatgcgct 
agagggaccc 
cggccaccaa 
tggatatccg 
gcatgaagaa 
ccaacggaag 
accgcgtggt 
ggtgtgagcg 
aggagcgagg 
catggagctc 
tcacttggaa 
aaggaaaccg 
tggtgtaagg 
acagagacca 
cctggcccac 
tgtaaatgtt 
tatttataaa 
tatatatata 
gcagctgtat 
tcaagtgaac 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1847 



<210> 9 
<211> 2369 
<212> DNA 
<213> Humanus 



<400> 9 

gaattccgcc 

agcagcggcg 

gtccaacttt 

tgctgtgctc 

gcctaccatg 

catgcagatc 

ctactcgctg 

gaaacctcaa 

ggacatgaat 

caagcaggca 

gcccacattt 

gtgccgacaa 

cacaggatca 

catgccacac 

gaccctgctg 

gcatcataga 

tgaagcgctg 

gatcttcaga 

gccgccatgt 

gttggatgag 

atctctgcag 

aagttttggc 

ggccttaaag 

gagagttctt 

gaccaaggaa 

catccaaagc 

tcttggtctg 



agccccgcca 
ccgtgctcgc 
gactgcgggt 
gtcaaagagt 
tacccaccct 
attgtgaaag 
gctgagaggt 
ggccgaatgc 
gaatttgaga 
aaggtccacc 
tgctctgtct 
tgcaatgcag 
gctatcaata 
agatttaaag 
tggggactgg 
tgccagacaa 
gccatgattg 
gaaggtccgg 
ttaccgacac 
gtggataaaa 
attaaactaa 
aaggtcttcc 
aaagatgtgg 
tccttggcct 
aacctctttt 
tgccacaagt 
cagttccttc 



gtccccgcgc 
tccagggcgc 
cctgccagtc 
atgtcgaatc 
gggacagcac 
gcaaaaacgt 
gcaggaagaa 
taatgaatgc 
cggaaggctt 
acgtcaagtg 
gccacgagtt 
caattcacaa 
gccgagaaac 
tctacaatta 
cacggcaagg 
aggtggccaa 
agagcactca 
ttgaaattgg 
cgggaaaaag 
tgtgccatct 
aaattgagga 
tggcagaatt 
tcttgatgga 
gggagcatcc 
ttgtgatgga 
tcgacctttc 
attccaaagg 



agtccccgcg 
aaccatgtcg 
ttgtcagggc 
agagaacggg 
ttttgatgcc 
ggacctcatc 
caacgggaag 
aagatacttt 
ctttgctttg 
ccacgagttc 
tgtctggggc 
gaagtgtatt 
catgttccac 
caagagcccg 
actcaagtgt 
cctttgtggc 
acaggctcgc 
tctcccatgc 
agagcctcag 
tccagaacct 
ttttatcttg 
caagaaaacc 
cgatgatgtt 
gtttctgacg 
gtacctcaac 
cagagcgacg 
aatagtctac 



cagtcccagc 
ccatttcttc 
gaggctgtta 
cagatgtata 
catatcaaca 
tctgaaacca 
acagaaatat 
ctggaaatga 
catcagcgcc 
actgccacct 
ctgaacaaac 
gataaagtta 
aaggagagat 
accttctgtg 
gatgcatgtg 
ataaaccaga 
tgcttaagag 
tccatcaaaa 
ggcatttcct 
gaactgaaca 
cacaaaatgt 
aatcaatttt 
gagtgcacga 
cacatgtttt 
ggaggggact 
ttttatgctg 
agggacctga 



gccaccgggc 
ggattggctt 
acccttactg 
tccagaaaaa 
agggaagagt 
ccgtggagct 
ggttagagct 
gtgacacaaa 
ggggtgccat 
tcttcccaca 
agggctacca 
tagcaaagtg 
tcaaaattga 
aacactgtgg 
gcatgaatgt 
agctaatggc 
atactgaaca 
atgaagcaag 
gggagtctcc 
aagaaagacc 
tggggaaagg 
tcgcaataaa 
tggtagagaa 
gtacattcca 
taatgtacca 
ctgaaatcat 
agctagataa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 
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catcctgtta 
catgttagga 
gatcttgctg 
ttatgaaatg 
ctccatccgc 
tctggtgaag 
gcattaggtc 
aatgattatt 
accaatatta 
ttataacata 
gtagttggtt 
ggaggttgag 
gacccctgtc 



gacaaagatg 
gatgccaaga 
ggtcagaaat 
ctgattggtc 
atggacaatc 
gtaagaagct 
ttctggtcag 
attgaactgg 
tgagcattaa 
acggccccag 
cattaaacaa 
gcaggaggat 
tccaaaacaa 



gacatatcaa 
cgaatacctt 
acaaccactc 
agtcgccttt 
ccttttaccc 
gaaggtaaga 
ttttctgttc 
gtttaaatgg 
atgtcaaaat 
gaaccagttc 
gcacagtgtg 
tgaagttgga 
taaagggga 



gatcgcggat 
ctgtgggaca 
tgtggactgg 
ccacgggcag 
acggtggctg 
agcgaagcca 
ctcttagttt 
gatgtgcacc 
gagtgtaaga 
catggacctt 
tggcttaaaa 
agtgtgaggt 



tttggaatgt 
cctgactaca 
tggtccttcg 
gatgaggagg 
gagaaggaag 
agagcgtctt 
ccaacttgca 
gtctgtgttt 
gaaaccctca 
gaatacgctc 
atcaatcttc 
cagcctgggc 



gcaaggagaa 
tcgccccaga 
gggttctcct 
agctcttcca 
caaaggacct 
cataagacga 
tgtggcaaac 
taatagaggc 
tgtgcatcag 
acctggagat 
tagctacttg 
aatgtatcga 



1680 
1740 
18'0 0 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2369 



<210> 10 
<211> 1101 
<212> DNA 
<213> Humanus 



<400> 10 

ctccttagtc 

tgcccagtcc 

ccctcttcca 

ccctcgcgcg 

tgcgcatgag 

ggattaataa 

gggacatcaa 

aagcagggga 

actccctgcc 

tgcgagtgta 

agtccagccg 

ataccttctc 

ttccaggcta 

gtgctgtcag 

ccagtgatct 

atgaggatga 

ggcagccaac 

cctctgtcta 

agaaggcccc 



gaggcaagac- 
acgccggccg 
ctcggagtcg 
cgaccagccg 
accctggcta 
agaggagatg 
caaggatgcc 
aaaggagcca 
agatatcgag 
ccggatgctt 
agatgctaag 
tgatggactc 
catgcaggac 
cagcactct c 
gtacaacttc 
ggaagggaaa 
aaacgtggat 
tggagacttt 
tggatcctta 



gtgcgcccga 
tgcccggcgg 
cgctccgcgc 
aatcgctcct 
gagatgcaga 
atcttccaga 
tgtttgttcc 
gatcccaaga 
gaggtgaaag 
ccacctctca 
agcaaggcca 
agcagctcca 
ttggaggtgg 
cccgactggc 
caggtgtcac 
ttacctgagg 
gggaaggggt 
agctgtaagg 
t 



gccccgccga 
ccttaagaac 
gccctcactg 
gcagcagagc 
ttaattccaa 
tcccatggaa 
ggagctgggc 
cgtggaaggc 
accagagcag 
ccaagaacca 
agaggaagtc 
ctctgcctga 
agcaggccct 
acatcccagt 
ccatgccctc 
acatcatgaa 
acctactcaa 
aggagccaga 



accgaggcca 
caggcaacct 
cagcccctgc 
caacatgccc 
ccaaatcccg 
gcatgctgcc 
cattcacaca 
caactttcgc 
gaacaagggc 
gagaaaagaa 
atgtggggat 
tgaccacagc 
gactccagca 
ggaagttgtg 
cacctctgaa 
gctcttggag 
tgaacctgga 
aattgacagc 



cccggagccg 
ctgccttctt 
gtcgccggga 
atcactcgga 
gggetcatct 
aagcatggct 
ggccgataca 
tgtgccatga 
agctcagctg 
agaaagtcga 
tccagccctg 
agctacacag 
ctgtcgccat 
ccggacagca 
gctacaacag 
cagtcggagt 
gtccagccca 
ccagggggta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1101 



<210> 11 
<211> 4628 
<212> DNA 
<213> Humanus 



<400> 11 

cctcttgcgg 

cgccggtgca 

ccggwcggga 

gcgcgccgct 

cgccccgcag 

ccgcagggcg 

gaggggggca 

ctccggggca 

gcgccgccgg 

gagcccgggg 

cggcggactc 

tcgagctcac 

gaactgggat 



ccacaggcgc 
gccgcagcgc 
ccgagacccc 
gcttgaaaag 
gcgcacggta 
atggagcccg 
caaggtctgg 
ccagctccgg 
gagtcgagcg 
gcggcgggcc 
tcccgaggcg 
tgtggagtat 
gtggagctgg 



ggcgtcctcg 
gcggaggaac 
tcgtagcgca 
ccgcggaacc 
cccgtgctgc 
gtctgcaagg 
agaccccggg 
ctccattgtt 
ccggccgcgg 
ggagccgggg 
gaacctccac 
ccatggagat 
aagtgcctcc 



gcggcgggcg 
ccgggtgtgc 
ttgcggcgac 
caaggacttt 
agtcgggcac 
aaagtgaggc 
tggcggacgg 
cccgcccggg 
agctcttgcg 
acgcgggcac 
gccgagcgag 
gtggagcctt 
tcttctgggc 



gcagctagcg 
cgggagctgg 
ctcgccttcc 
tctccggtcc 
gccgcgggcg 
gccgccgctg 
gagccctccc 
ctggaggcgc 
accccgccag 
acgcccgctc 
ggtcagtttg 
gtcaccaacc 
tgtgctggtc 



ggagccggga 
gcggccacgt 
ccggccgcga 
gagctcgggg 
ccgggggcct 
cgttctggag 
cccgccccgc 
cgagcaccga 
gacccgaaca 
gcacaagcca 
aaaaggagga 
tctaactgca 
acagccacac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
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tctgcaccgc taggccgtcc ccgaccttgc ctgaacaagc ccagccctgg ggagcccctg 840 ; 
tggaagtgga gtccttcctg gtccaccccg gtgacctgct gcagcttcgc tgtcggctgc 900 
ggga"cgatgt gcagagcatc aactggctgc gggacggggt gcagctggcg gaaagcaacc 960 
gcacccgcat cacaggggag gaggtggagg tgcaggactc cgtgcccgca gactccggcc 1020 
tctatgcttg cgtaaccagc agcccctcgg gcagtgacac cacctacttc tccgtcaatg 1080 
tttcagatgc tctcccctcc tcggaggatg atgatgatga tgatgactcc tcttcagagg 1140 
agaaagaaac agataacacc aaaccaaacc gtatgcccgt agctccatat tggacatccc 1200 
cagaaaagat ggaaaagaaa ttgcatgcag tgccggctgc caagacagtg aagttcaaat 1260- 
gcccttccag tgggacccca aaccccacac tgcgctggtt gaaaaatggc aaagaattca 1320 
aacctgacca cagaattgga ggctacaagg tccgttatgc cacctggagc atcataatgg 1380 
actctgtggt gccctctgac aagggcaact acacctgcat tgtggagaat gagtacggca 1440 
gcatcaacca cacataccag ctggatgtcg tggagcggtc ccctcaccgg cccatcctgc 1500 
aagcagggtt gcccgccaac aaaacagtgg ccctgggtag caacgtggag ttcatgtgta 1560 
aggtgtacag tgacccgcag ccgcacatcc agtggctaaa gcacatcgag gtgaatggga 1620 
gcaagattgg cccagacaac ctgccttatg tccagatctt gaagactgct ggagttaata 1680 
ccaccgacaa agagatggag gtgcttcact taagaaatgt ctcctttgag gacgcagggg 1740 
agtatacgtg cttggcgggt aactctatcg gactctccca tcactctgca tggttgaccg 1800 
ttctggaagc cctggaagag aggccggcag tgatgacctc gcccctgtac ctggagatca 18 60 
tcatctattg cacaggggcc ttcctcatct cctgcatggt ggggtcggtc atcgtctaca 1920 
agatgaagag tggtaccaag aagagtgact tccacagcca gatggctgtg cacaagctgg 198 0 
ccaagagcat ccctctgcgc agacaggtaa cagtgtctgc tgactccagt gcatccatga 2040 
actctggggt tcttctggtt cggccatcac ggctctcctc cagtgggact cccatgctag 2100 
caggggtctc tgagtatgag cttcccgaag accctcgctg ggagctgcct cgggacagac 2160 
tggtcttagg caaacccctg ggagagggct gctttgggca ggtggtgttg gcagaggcta 2220 
tcgggctgga caaggacaaa cccaaccgtg tgaccaaagt ggctgtgaag atgttgaagt 2280 
cggacgcaac agagaaagac ttgtcagacc tgatctcaga aatggagatg atgaagatga 2340 
tcgggaagca taagaatatc atcaacctgc tgggggcctg cacgcaggat ggtcccttgt 2400 
atgtcatcgt ggagtatgcc tccaagggca acctgcggga gtacctgcag gcccggaggc 24 60 
ccccagggct ggaatactgc tacaacccca gccacaaccc agaggagcag ctctcctcca 2520 
aggacctggt gtcctgcgcc taccaggtgg cccgaggcat ggagtatctg gcctccaaga 2580 
agtgcataca ccgagacctg gcagccagga atgtcctggt gacagaggac aatgtgatga 2640 
agatagcaga ctttggcctc gcacgggaca ttcaccacat cgactactat aaaaagacaa 2700 
ccaacggccg actgcctgtg aagtggatgg cacccgaggc attatttgac cggatctaca 2760 
cccaccagag tgatgtgtgg tctttcgggg tgccccatac cccggtgtgc ctgtggagga 2820 
acttttcaag ctgctgaagg agggtcaccg catggacaag cccagtaact gcaccaacga 2880 
gctgtacatg atgatgcggg actgctggca tgcagtgccc tcacagagac ccaccttcaa 2940 
gcagctggtg gaagacctgg accgcatcgt ggccttgacc tccaaccagg agtacctgga 3000 
cctgtccatg cccctggacc agtactcccc cagctttccc gacacccgga gctctacgtg 3060 
ctcctcaggg gaggattccg tcttctctca tgagccgctg cccgaggagc cctgcctgcc 3120 
ccgacaccca gcccagcttg ccaatggcgg actcaaacgc cgctgactgc cacccacacg 318 0 
ccctccccag actccaccgt cagctgtaac cctcacccac agcccctgct gggcccacca 3240 
cctgtccgtc cctgtcccct ttcctgctgg caggagccgg ctgcctacca ggggccttcc 3300 
tgtgtggcct gccttcaccc cactcagctc acctctccct ccacctcctc tccacctgct 3360 
ggtgagaggt gcaaagaggc agatctttgc tgccagccac ttcatcccct cccagatgtt 3420 
ggaccaacac ccctccctgc caccaggcac tgcctggagg gcagggagtg ggagccaatg 3480 
aacaggcatg caagtgagag cttcctgagc tttctcctgt cggtttggtc tgttttgcct 3540 
tcacccataa gcccctcgca ctctggtggc aggtgccttg tcctcagggc tacagcagta 3600 
gggaggtcag tgcttcgtgc ctcgattgaa ggtgacctct gccccagata ggtggtgcca 3660 
gtggcfctatt aattccgata ctagtttgct ttgctgacca aatgcctggt accagaggat 3720 
ggtgaggcga aggccaggtt gggggcagtg ttgtggccct ggggcccagc cccaaactgg 3780 
gggctctgta tatagctatg aagaaaacac aaagtgtata aatctgagta tatatttaca 3840 
tgtcttttta aaagggtcgt taccagagat ttacccatcg ggtaagatgc tcctggtggc 3900 
tgggaggcat cagttgctat atattaaaaa caaaaaagaa aaaaaaggaa aatgttttta 3960 
aaaaggtcat atattttttg ctacttttgc tgttttattt ttttaaatta tgttctaaac 4020 
ctattttcag tttaggtccc tcaataaaaa ttgctgctgc ttcatttatc tatgggctgt 4080 
atgaaaaggg tgggaatgtc cactggaaag aagggacacc cacgggccct ggggctaggt 4140 
ctgtcccgag ggcaccgcat gctcccggcg caggttcctt gtaacctctt cttcctaggt 4200 
cctgcaccca gacctcacga cgcacctcct gcctctccgc tgcttttgga aagtcagaaa 4260 
aagaagatgt ctgcttcgag ggcaggaacc ccatccatgc agtagaggcg ctgggcagag 4320 
agtcaaggcc cagcagccat cgaccatgga tggtttcctc caagggaaac cggtggggtt 4380 
tgggctgggg agggggcacc tacctaggga ttagccacgg ggtagagcta cagtgattaa 4440 
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gaggaaagca agggcgtgag ccaccacgcc tggcattaat ttgtcctttt ttaagatatg 4500 
gggtctcact atgttgccca ggctggtctt gaactcctgg gctcaagtg^a tcctcctgcc 4560 
ttagcctccc aaagtgctgg gattacaggc gtgagccact gtgcccagcc tgatgtgttc 4 620 
ttaattta 4628 



<210> 12 1 
<211> 1418 
<212> DNA 
<213> Humanus 

<400> 12 

ccacgtgggg gctcagcgtg cacccttctt tgtgctcggg ttaggaggag ctaggctgcc 60 
atcgggccgg tgcagatacg gggttgctct tttgctcata agaggggctt cgctggcagt 120 
ctgaacggca agcttgagtc aggaccctta attaagatcc tcaattggct ggagggcaga 180 
tctcgcgagt agggcaacgc ggtaaaaata ttgcttcggt gggtgacgcg gtacagctgc 240 
ccaagggcgt tcgtaacggg aatgccgaag cgtgggaaaa agggagcggt ggcggaagac 300 
ggggatgagc tcaggacaga gccagaggcc aagaagagta agacggccgc aaagaaaaat 360 
gacaaagagg cagcaggaga gggcccagcc ctgtatgagg accccccaga tcagaaaacc 420 
tcacccagtg gcaaacctgc cacactcaag atctgctctt ggaatgtgga tgggcttcga 480 
gcctggatta agaagaaagg attagattgg gtaaaggaag aagccccaga tatactgtgc 540 
cttcaagaga ccaaatgttc agagaacaaa ctaccagctg aacttcagga gctgcctgga 600 
ctctctcatc aatactggtc agctccttcg gacaaggaag ggtacagtgg cgtgggcctg 660 
ctttcccgcc agtgcccact caaagtttct tacggcatag catatgtacc taatgcaggc 720 
cgaggtctgg tacgactgga gtaccggcag cgctgggatg aagcctttcg caagttcctg 780 
aagggcctgg cttcccgaaa gccccttgtg ctgtgtggag acctcaatgt ggcacatgaa 840 
gaaattgacc ttcgcaaccc caaggggaac aaaaagaatg ctggcttcac gccacaagag 900 
cgccaaggct tcggggaatt actgcaggct gtgccactgg ctgacagctt taggcacctc 960 
taccccaaca caccctatgc ctacaccttt tggacttata tgatgaatgc tcgatccaag 1020 
aatgttggtt ggcgccttga ttactttttg ttgtcccact ctctgttacc tgcattgtgt 1080 
gacagcaaga tccgttccaa ggccctcggc agtgatcact gtcctatcac cctataccta 1140 
gcactgtgac accaccccta aatcactttg agcc'tgggaa ataagccccc tcaactacca 1200 
ttccttcttt aaacactctt cagagaaatc tgcattctat ttctcatgta taaaactagg 1260 
aatcctccaa ccaggctcct gtgatagagt tcttttaagc ccaagatttt ttatttgagg 1320 
gttttttgtt ttttaaaaaa aaattgaaca aagactacta atgactttgt ttgaattatc 1380 
cacatgaaaa taaagagcca tagtttcagc gtcgtacc 1418 



<210> 13 
<211> 1558 
<212> DNA 
<213> Humanus 

<400> 13 

ccacgtgggg gctcagcgtg cacccttctt tgtgctcggg ttaggaggag ctaggctgcc 60 
atcgggccgg tgcagatacg gggttgctct tttgctcata agaggggctt cgctggcagt 120 
ctgaacggca agcttgagtc aggaccctta attaagatcc tcaattggct ggagggcaga 18 0 
tctcgcgagt agggcaacgc ggtaaaaata ttgcttcggt gggtgacgcg gtacagctgc 24 0 
ccaagggcgt tcgtaacggg aatgccgaag cgtgggaaaa agggagcggt ggcggaagac 300 
ggggatgagc tcaggacagg taagggaatg aaatcagccc ttcttcctag aactgcggcg 360 
ggggtgtttg tcattccctt gatgtacgag agccagaggc caagaagagt aagacggccg 420 
caaagaaaaa tgacaaagag gcagcaggag agggcccagc cctgtatgag gaccccccag 480 
atcagaaaac ctcacccagt ggcaaacctg ccacactcaa gatctgctct tggaatgtgg 540 
atgggcttcg agcctggatt aagaagaaag gattagattg ggtaaaggaa gaagccccag 600 
atatactgtg ccttcaagag accaaatgtt cagagaacaa actaccagct gaacttcagg 660 
agctgcctgg actctctcat caatactggt cagctccttc ggacaaggaa gggtacagtg 720 
gcgtgggcct gctttcccgc cagtgcccac tcaaagtttc ttacggcata ggcgatgagg 780 
agcatgatca ggaaggccgg gtgattgtgg ctgaatttga ctcgtttgtg ctggtaacag 840 
catatgtacc taatgcaggc cgaggtctgg tacgactgga gtaccggcag cgctgggatg 900 
aagcctttcg caagttcctg aagggcctgg cttcccgaaa gccccttgtg ctgtgtggag 960 
acctcaatgt ggcacatgaa gaaattgacc ttcgcaaccc caaggggaac aaaaagaatg 1020 
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ctggcttcac gccacaagag cgccaaggct tcggggaatt actgcaggct gtgccactgg 1080 
ctgacagctt taggcacctc taccccaaca caccctatgc ctacaccttt tggacttata 1140 
tgatgaatgc tcgatccaag aatgttggtt ggcgccttga ttactttttg ttgtcccact 1200 
ctctgttacc tgcattgtgt gacagcaaga tccgttccaa ggccctcggc agtgatcact 1260 
gtcctatcac cctataccta gcactgtgac accaccccta aatcactttg agcctgggaa 1320 
ataagccccc tcaactacca ttccttcttt aaacactctt cagagaaatc tgcattctat 1380 
ttctcatgta taaaactagg aatcctccaa ccaggctcct gtgatagagt tcttttaagc 1440 
ccaagatttt ttatttgagg gttttttgtt ttttaaaaaa aaattgaaca aagactacta 1500 
atgactttgt ttgaattatc cacatgaaaa taaagagcca tagtttcagc gtcgtacc 1558 



<210> 14 
<211> 1911 
<212> DNA 
<213> Humanus 

<400> 14 

tgccgccgtc ccgcccgcca gcgccccagc gaggaagcag cgcgcagccc gcggcccagc 60 

gcacccgcag cagcgcccgc agctcgtccg cgccatgttc caggcggccg agcgccccca 120 

ggagtgggcc atggagggcc cccgcgacgg gctgaagaag gagcggctac tggacgaccg 180 

ccacgacagc ggcctggact ccatgaaaga cgaggagtac gagcagatgg tcaaggagct 240 

gcaggagatc cgcctcgagc cgcaggaggt gccgcgcggc tcggagccct ggaagcagca 300 

gctcaccgag gacggggact cgttcctgca cttggccatc atccatgaag aaaaggcact 360 

gaccatggaa gtgatccgcc aggtgaaggg agacctggcc ttcctcaact tccagaacaa 420 

cctgcagcag actccactcc acttggctgt gatcaccaac cagccagaaa ttgctgaggc 4 80 

acttctggga gctggctgtg atcctgagct ccgagacttt cgaggaaata cccccctaca 540 

ccttgcctgt gagcagggct gcctggccag cgtgggagtc ctgactcagt cctgcaccac 600 

cccgcacctc cactccatcc tgaaggctac caactacaat ggccacacgt gtctacactt 660 

agcctctatc catggctacc tgggcatcgt ggagcttttg gtgtccttgg gtgctgatgt 720 

caatgctcag gagccctgta atggccggac tgcccttcac ctcgcagtgg acctgcaaaa 780 

tcctgacctg gtgtcactcc tgttgaagtg tggggctgat gtcaacagag ttacctacca 840 

gggctattct ccctaccagc tcacctgggg ccgcccaagc acccggatac agcagcagct 900 

gggccagctg acactagaaa accttcagat gctgccagag agtgaggatg aggagagcta 960 

tgacacagag tcagagttca cggagttcac agaggacgag gtgagtctgt gaactcttcg 1020 

gcgctctaac taatgaggtg ccattccctt caccctccca ggcccctaga gctgctcctt 1080 

atcagagggg tatctacata atgagtctct caaatttctg tgcataacca gtatcccaag 1140 

aatgtacctg ccccccccct ttttttaata gcttactctt tttttaaaga gaagcattaa 1200 

aggcaaactc cctcagcctg taaagttcat tatatttggg ctatggagaa tggagtccaa 1260 

gagttatttc cagtagtggc ctccccatcc cggtagcttg gcagagctcc gctccggaag 1320 

cttaacgtgt cttttttccc cttgttttca gctgccctat gatgactgtg tgtttggagg 1380 

ccagcgtctg acgttatgag cgcaaagggg ctgaaagaac atggacttgt atatttgtac 1440 

aaaaaaaaag ttttattttt ctaaaaaaag aaaaaagaag aaaaaattta aagggtgtac 1500 

ttatatccac actgcacact gcctggccca aaacgtctta ttgtggtagg atcagccctc 1560 

attttgttgc ttttgtgaac tttttgtagg ggacgagaaa gatcattgaa attctgagaa 1620 

aacttctttt aaacctcacc tttgtggggt ttttggagaa ggttatcaaa aatttcatgg 1680 

aaggaccaca ttttatattt attgtgcttc gagtgactga ccccagtggt atcctgtgac 1740- 

atgtaacagc caggagtgtt aagcgttcag tgatgtgggg tgaaaagtta ctacctgtca 1800 

aggtttgtgt taccctcctg taaatggtgt acataatgta ttgttggtaa ttattttggt 1860 

acttttatga tgtatattta ttaaacagat ttttacaaat ggaanaaaga n 1911 



<210> 15 
<211> 1476 
<212> DNA 
<213> Humanus 

<400> 15 

tgccgccgtc ccgcccgcca gcgccccagc 
gcacccgcag cagcgcccgc agctcgtccg 
ggagtgggcc atggagggcc cccgcgacgg 
ccacgacagc ggcctggact ccatgaaaga 



gaggaagcag cgcgcagccc gcggcccagc 60 

cgccatgttc caggcggccg agcgccccca 120 

gctgaagaag gagcggctac tggacgaccg 180 

cgaggagtac gagcagatgg tcaaggagct 240 
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gcaggagatc 
gctcaccgag 
gaccatggaa 
cctgcagcag 
acttctggga 
ccttgcctgt 
cccgcacctc 
ccggactgcc 
gaagtgtggg 
ctggggccgc 
tcagatgctg 
gttcacagag 
atgagcgcaa 
tttttctaaa 
acactgcctg 
tgaacttttt 
tcacctttgt 
tatttattgt 
gtgttaagcg 
tcctgtaaat 
atttattaaa 



cgcctcgagc 
gacggggact 
gtgatccgcc 
actccactcc 
gctggctgtg 
gagcagggct 
cactccatcc 
cttcacctcg 
gctgatgtca 
ccaagcaccc 
ccagagagtg 
gacgagctgc 
aggggctgaa 
aaaagaaaaa 
gcccaaaacg 
gtaggggacg 
ggggtttttg 
gcttcgagtg 
ttcagtgatg 
ggtgtacata 
cagattttta 



cgcaggaggt 
cgttcctgca 
aggtgaaggg 
acttggctgt 
atcctgagct 
gcctggccag 
tgaaggctac 
cagtggacct 
acagagttac 
ggatacagca 
aggatgagga 
cctatgatga 
agaacatgga 
agaagaaaaa 
tcttattgtg 
agaaagatca 
gagaaggtta 
actgacccca 
tggggtgaaa 
atgtattgtt 
caaatggaan 



gccgcgcggc 
cttggccatc 
agacctggcc 
gatcaccaac 
ccgagacttt 
cgtgggagtc 
caactacaat 
gcaaaatcct 
ctaccagggc 
gcagctgggc 
gagctatgac 
ctgtgtgttt 
cttgtatatt 
atttaaaggg 
gtaggatcag 
ttgaaattct 
tcaaaaattt 
gtggtatcct 
agttactacc 
ggtaattatt 
aaagan 



tcggagccct 
atccatgaag 
ttcctcaact 
cagccagaaa 
cgaggaaata 
ctgactcagt 
ggtcaggagc 
gacctggtgt 
tattctccct 
cagctgacac 
acagagtcag 
ggaggccagc 
tgtacaaaaa 
tgtacttata 
ccctcatttt 
gagaaaactt 
catggaagga 
gtgacatgta 
tgtcaaggtt 
ttggtacttt 



ggaagcagca 
aaaaggcact 
tccagaacaa 
ttgctgaggc 
cccccctaca 
cctgcaccac 
cctgtaatgg 
cactcctgtt 
accagctcac 
tagaaaacct 
agttcacgga 
gtctgacgtt 
aaaagtttta 
tccacactgc 
gttgcttttg 
cttttaaacc 
ccacatttta 
acagccagga 
tgtgttaccc 
tatgatgtat 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1476 



<210> 16 
<211> 2913 
<212> DNA 
<213> Humanus 



<400> 16 

aagcggcagg 

ttttcgtgtc 

gaagttacct 

ctggaaggag 

taccaagtgt 

acccgagaag 

agtcttccgg 

gacaacgaca 

gctgatgaga 

atccgggatg 

gcctgcatcg 

aatctggccc 

cgaggctcct 

gatggtgaat 

agcggagaat 

tgtgccaagt 

gaccgaggct 

tctgctcccc 

agccctcaga 

ggagctggtg 

cagaatggct 

acctttgaaa 

gtttctgtca 

aaagaagtca 

gtaatcctgg 

atagttcgga 

gttttccacg 

gttacaacca 

ctggtctctg 

agccggagac 

aatcaaggtg 

cgagagtttg 



agcagcgttg 
tcttcgggat 
tattggattc 
ggtgggagga 
gcaatgtgat 
gggctcagag 
gcgtcatggg 
aagagcgttt 
gcttcaccca 
tagggccatt 
ccctggtatc 
agtttcctga 
gtgtcaacaa 
ggctggtacc 
gccaagcttg 
gcccacccca 
ttttcagagc 
tgaacttgat 
atacaggtgg 
accccagcaa 
tgaagaccac 
tctgggctgt 
ctgtgaccac 
caagatacag 
aatatgaagt 
cagctgccag 
tgcgagccag 
acacagtgcc 
tctcgggcag 
ggagtaaata 
taagaacata 
ccaaagaaat 



gcaccggcga 
ttgcgacgct 
cagatctgtt 
agtgagtatc 
ggaacccagc 
ggtgtatatt 
gacttgcaag 
catcagagag 
agtggacatt 
aagcaaaaag 
agtccgtgtg 
caccatcaca 
ctcagaagag 
cattggcaac 
caaaattgga 
cagctactct 
tgacaacgat 
ttcaaatgtc 
ccgccaggac 
gtgccgaccc 
caaagtctcc 
gaatggagtg 
caaccaagca 
tgtggcactg 
caagtattat 
gaacacagat 
gacagcagct 
ttcccggatc 
tgtggtgctg 
cagtaaagcc 
tgtggacccc 
tgacgcatcc 



accatggctg 
gtcacaggtt 
cagggagaac 
atggatgaaa 
cagaataact 
gagattaaat 
gagacgttta 
aaccagtttg 
ggtgacagaa 
gggttttacc 
ttctataaaa 
ggggctgata 
aaagatgtgc 
tgcctatgca 
tattacaagg 
gtctgggaag 
gctgcctcta 
aacgagacat 
atttcctata 
tgtggaagtg 
atcactgacc 
tccaaatata 
gcaccatcat 
gcttggctgg 
gagaaggatc 
atcaaaggcc 
ggctatggag 
attggagatg 
gtggtaattc 
aaacaagaag 
tttacgtacg 
tgcattaaga 



ggattttcta 
ccagggtata 
ttgggtggat 
aaaatacacc 
ggctacgaac 
tcaccttgag 
acctgtacta 
tcaaaattga 
tcatgaagct 
tggcttttca 
agtgtccact 
cgtcttccct 
caaaaatgta 
acgctgggca 
ctctctccac 
gagccacctc 
tgccctgcac 
ctgtgaactt 
atgtggtatg 
gggtccacta 
tcctagctca 
accctaaccc 
ccattgcttt 
aaccagatcg 
agaatgagcg 
tgaaccctct 
acttcagtga 
gggctaactc 
tcattgcagc 
cggatgaaga 
aagatcccaa 
ttgaaaaagt 



tttcgcccta 
ccccgcgaat 
agcaagccct 
aatccgaacc 
tgattggatc 
ggactgcaat 
ctatgaatca 
caccattgct 
gaacaccgag 
ggatgtgggg 
cacagtccgc 
ggtggaagtt 
ctgtggggca 
tgaggagcgg 
ggatgccacc 
gtgcacctgt 
ccgtccacca 
ggaatggagt 
caagaaatgt 
caccccacag 
taccaattac 
agaccaatca 
ggtccaggct 
gcccaatggg 
aagctatcgt 
cacttcctat 
gcccttggag 
cacagtcctt 
ttttgtcatc 
gaaacatttg 
ccaagcagtg 
tataggagtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 
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ggtgaatttg gtgaggtatg cagtgggcgt ctcaaagtgc ctggcaagag agagatctgt 1980 

gtggctatca agactctgaa agctggttat acagacaaac agaggagaga cttcctgagt 2040 

gaggccagca tcatgggaca gtttgaccat ccgaacatca ttcacttgga aggcgtggtc 2100 

actaaatgta aaccagtaat gatcataaca gagtacatgg agaatggctc cttggatgca 2160 

ttcctcagga aaaatgatgg cagatttaca gtcattcagc tggtgggcat gcttcgtggc 2220 

attgggtctg ggatgaagta tttatctgat atgagctatg tgcatcgtga tctggccgca 2280 

cggaacatcc tggtgaacag caacttggtc tgcaaagtgt ctgattttgg catgtcccga 2340 

gtgcttgagg atgatccgga agcagctrtac accaccaggg gtggcaagat tcctatccgg 2400 

tggactgcgc cagaagcaat tgcctatcgt aaattcacat cagcaagtga tgtatggagc 2460 

tatggaatcg ttatgtggga agtgatgtcg tacggggaga ggccctattg ggatatgtcc 2520 

aatcaagata cctaacactg ccttgttgga tccaagctcc cctgaattct ctgctgtggt 2580 

atcagtgggc gattggctcc aggccattaa aatggaccgg tataaggata acttcacagc 2640 

tgctggttat accacactag aggctgtggt gcacgtgaac caggaggacc tggcaagaat 2700 

tggtatcaca gccatcacgc accagaataa gattttgagc agtgtccagg caatgcgaac 2760 

ccaaatgcag cagatgcacg gcagaatggt tcccgtctga gccagtactg aataaactca 2820 

aaactcttga aattagttta cctcatccat gcactttaat tgaagaactg cacttttttt 2880 

' acttcgtctt cgccctctga aattaaagaa atg 2913 



<210> 17 
<211> 3695 
<212> DNA 
<213> Humanus 

<400> 17 

ccgtttcctc ccctcccctc cactcggccg tccctccttc ctcctccctc ctccctcctc 60 
ctcccgctcc tgaagagcgc gccgcgtggg ggacggcccg gttacttcct ccagagactg 120 
acgagtgcgg tgtcgctcca gctcagagct cccggagccg cccggccagc gtccggcctc 180 
cctgatcgtc tctggccggc gccctcgccc tcgcccggcg cgcaccgagc agccgcgggc 240 
gccgagcagc caccgtcccg accaagcgcc ggccctgccc gcagcggcag gatgaatgat 300 
ttcggaatca agaatatgga ccaggtagcc cctgtggcta acagttacag agggacactc 360 
aagcgccagc cagcctttga cacctttgat gggtccctgt ttgctgtttt tccttctcta 420 
aatgaagagc aaacactgca agaagtgcca acaggcttgg attccatttc tcatgactcc 480 
gccaactgtg aattgccttt gttaaccccg tgcagcaagg ctgtgatgag tcaagcctta 540 
aaagctacct tcagtggctt ctctgggcca ccaatgagtt cagtctggtg aacgtgaatc 600 
tgcagaggtt cggcatgaat ggccagatgc tgtgtaacct tggcaaggaa cgctttctgg 660 
agctggcacc tgactttgtg ggtgacattc tctgggaaca tctggagcaa atgatcaaag 720 
aaaaccaaga aaagacagaa gatcaatatg aagaaaattc acacctcacc tccgttcctc 780 
attggattaa cagcaataca ttaggttttg gcacagagca ggcgccctat ggaatgcaga 840 
cacagaatta ccccaaaggc ggcctcctgg acagcatgtg tccggcctcc acacccagcg 900 
tactcagctc tgagcaggag tttcagatgt tccccaagtc tcggctcagc tccgtcagcg 960 
tcacctactg ctctgtcagt caggacttcc caggcagcaa cttgaatttg ctcaccaaca 1020 
attctgggac gcccaaagac cacgactccc ctgagaacgg tgcggacagc ttcgagagct 1080 
cagactccct cctccagtcc tggaacagcc agtcgtcctt gctggatgtg caacgggttc 1140 
cttccttcga gagcttcgaa gatgactgca gccagtctct ctgcctcaat aagccaacca 1200 
tgtctttcaa ggattacatc caagagagga gtgacccggt ggagcaaggc aaaccagtta 1260 
tacctgcagc tgtgctggcc ggcttcacag gaagtggacc tattcagctg tggcagtttc 1320 
tcctggagct gctatcagac aaatcctgcc agtcattcat cagctggact ggagacggat 1380 
gggagtttaa gctcgccgac cccgatgagg tggcccgccg gtggggaaag aggaaaaata 1440 
agcccaagat gaactacgag aagctgagcc ggggcttacg ctactattac gacaagaaca 1500 
tcatccacaa gacgtcgggg aagcgctacg tgtaccgctt cgtgtgcgac ctccagaact 1560 
tgctggggtt cacgcccgag gaactgcacg ccatcctggg cgtccagccc gacacggagg 1620 
actgaggtcg ccgggaccac cctgagccgg ccccaggctc gtggactgag tgggaagccc 1680 
atcctgacca gctgctccga ggacccagga aaggcaggat tgaaaatgtc caggaaagtg 1740 
gccaagaagc agtggcctta ttgcatccca aaccacgcct cttgaccagg ctgcctccct 1800 
tgtggcagca acggcacagc taattctact cacagtgctt ttaagtgaaa atggtcgaga 1860 
aagaggcacc aggaagccgt cctggcgcct ggcagtccgt gggacgggat ggttctggct 1920 
gtttgagatt ctcaaaggag cgagcatgtc gtggacacac acagactatt tttagatttt 1980 
cttttgcctt ttgcaaccag gaacagcaaa tgcaaaaact ctttgagagg gtaggagggt 2040 
gggaaggaaa caaccatgtc atttcagaag ttagtttgta tatattataa taatcttata 2100 
attgttctca gaatccctta acagttgtat ttaacagaaa ttgtatattg taatttaaaa 2160 
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taattatata 
aatagcacat 
taaagtatgt 
cctgcttttt 
ggttgtctgg 
tctgggctta 
gtatttttct 
atacacaaaa 
ctgaccaccc 
tattattttt 
"atttttactg 
tttatctcag 
aggggtcttg 
tgtctcaacc 
gtgattctta 
gaaagagaaa 
ttgtagtcgt 
ggtcaagttg 
gggtggctga 
caagctggga 
agtatacata 
tttataatat 
tggctcattg 
tacaaagaag 
accattggca 
ctctcaataa 



actgtatttg 
atggaatttt 
attatttaca 
gtgttttttt 
caaggacttt 
tatctggcct 
tgacaaatgg 
tggattttat 
agccatttcc 
ggaaatatgc 
caaaagtaat 
ctctatgtaa 
atgaaacctc 
atcttaatag 
ttttcttgga 
gcatcggagc 
ttcaaaagag 
gtttcaaagt 
catctgtaaa 
agagcaaagc 
agcacaggat 
aaacagacat 
tatgtgaggt 
gagtttcatg 
tggtaataaa 
taaagtattt 



aaataagaat 
gcaaagattt 
tttaatagac 
gtttgtttgt 
gtacatttgg 
ctgctttctc 
catatgtttt 
tttttagttt 
ttcctgtgct 
aactcctttt 
tcactgggaa 
agttaaagtt 
ttgaacgaag 
catgctgctc 
cacagactat 
cattcattcg 
cacctgagtc 
ccgacaggct 
gaatcctcct 
cagagctgcg 
gtttttctca 
gtgactggga 
tgaaccacgt 
gggttcagcc 
cagatcttaa 
tgtttatata 



tcagacatct 
aatctgccaa 
ttacagggat 
ttgtttgttt 
gagtttttat 
ctttaattgt 
ccacttcttt 
aactgtgttt 
ccacgttctt 
cagagatcag 
aaaaaatgta 
actgtacaga 
cacagtttgt 
ctttttgctc 
tctgaggcac 
gagaaaacgt 
atgtgtattc 
tgtctgttta 
gtgatgaaac 
ctgcctcaat 
agagggatgt 
acatcttgct 
gaaattgcca 
taacagttat 
gtataaaaat 
aaaaa 



gaggttttat 
gggccgacta 
aaggcctgtg 
ttggggggtt 
gagaaactta 
aaagtaaaag 
gcatgcgttt 
ctccgacagc 
ctgtgtgatt 
gagggattta 
atttgtaaga 
gctgaaggac 
cccatctttg 
agtgtccaca 
agagcgggga 
tttgatcaaa 
ccggccttta 
ctagctgcgt 
tgaggaatcg 
acccacaaaa 
atttatcact 
gccaaaagaa 
atattaggct 
ggaaactaca 
tttgtaattg 



ttcatttttc 
agagaagttg 
gggggtaatc 
ttcttgcctt 
aatgttatta 
ctataaagca 
aagtcagttt 
tcacctctct 
aaaataagaa 
tgtagcagct 
aagctttatt 
ggggggcggt 
ttcactcgtg 
gcaagatgac 
cttaagatgg 
atggagactt 
taaatgaccc 
ggccttggac 
ggtggccggg 
gaccattccc 
tggacatctg 
tcctaggcag 
ggcttttatc 
gtccttataa 
ggcctttact 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3695 



<210> 18 
<211> 1535 
<212> DNA 
<213> Humanus 



<400> 18 

attaattctg 

agcgggttcc 

cagggggtgc 

tcaacagaat 

aaatcagatt 

atcttgccat 

acccccgatg 

atcttttcct 

gggatggctg 

tccttcgccc 

gttcagatgt 

ctttctcaag 

gaaggcgcga 

tgcagcactg 

acggccttca 

cgcgagggcg 

cgggactggc 

gagttcgtgg 

cgcatcctca 

gatgtggcct 

cagctggcag 

taaaaaacaa 

acttgtatat 

agaaaccatg 

tttaccaaaa 

gggaattctc 



gctccacttg 
tgagtgaatt 
gggactcgag 
tgagacacgt 
tcctggtgag 
caaaaaactc 
ctcccctggt 
tcttccctct 
gaagtgcaat 
aggttgtaat 
cagaagtata 
gacagaagaa 
agacaggcat 
tggataacac 
catacgccgt 
agctgtccac 
tctggggcgg 
acgcccgcga 
tgaacctgca 
gcaagtgcca 
acttccgcaa 
acatcgaaag 
tggtaagtgg 
acggatagtt 
ccaagctatt 
ttttcctata 



ttgctcggcc 
acccaggagg 
cgagcaggaa 
ttgtaatcgc 
gttgcgtggg 
acggaggaga 
ttaacttgta 
ccagaagtcc 
gtcttccaag 
tgaagccaat 
tattatagga 
actgtgccac 
caaagaatgc 
ctctgttttt 
gagcgcagca 
ctgcggctgc 
ctgcggcgac 
gcgggagcgc 
caacaacgag 
tggggtgtcc 
ggtgggtgat 
gcttatgcca 
tttcaattgt 
taggatgact 
tggaagcttc 
tatgttcctt 



caggttgggg 
gactgagcac 
ggaggcagcg 
tggcgtgccc 
tggattaatt 
agcgcagtca 
tgcttgaaaa 
attggaatat 
ttcttcctag 
tcttggtggt 
gcacagcctc 
ttgtatcagg 
cagtatcaat 
ggcagggtga 
ggggtggtga 
agccgcgccg 
aacatcgact 
atccacgcca 
gccggccgca 
ggctcatgta 
gccctgaagg 
aatggaagat 
ccttcagata 
acatgtcaaa 
aaaaggtttc 
acaaa 



agaggacgga 
agcaccaact 
cctggcacca 
cgcgcacagg 
tggaaaaaga 
atcaacagta 
ttatctgaga 
taagcccagg 
tggctttggc 
cgctaggtat 
tctgcagcca 
accacatgca 
tccgacatcg 
tgcagatagg 
acgccatgag 
cgcgccccaa 
atggctaccg 
agggctccta 
ggacggtgta 
gcctgaagac 
agaagtacga 
agaatataaa 
attcatgtgg 
gtaataaaag 
tatatgtaat 



gggtggccgc 
agagaggggt 
gggctttgac 
atcccagcga 
aactgcctat 
aacttaagag 
gggaataaac 
agttgctttg 
catatttttc 
gaataaccct 
actggcagga 
gtacatcgga 
acggtggaac 
cagccgcgag 
ccgggcgtgc 
ggacctgccg 
ctttgccaag 
cgagagtgct 
caacctggct 
atgctggctg 
cacactggtt 
ataaaacgtt 
agatttttgg 
agtggtgaat 
ggaacaaaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1535 
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<210> 19 
<211> 3877 
<212> DNA 
<213> Humanus 



<400> 19 

cattagatct ttacatgaaa gtaaaattta 
ctatttctta ggatactaaa agcactcaca 
cacaaacatt aaaggctccc tataaaaaaa 
aaatattaat agtttgcaat acatatgtat 
aagctttaca actcaaaaaa tacaaagaaa 
aacagaacct tcacaaaaga agataagaat 
acatcattag ccatgatgga aatgcaaatt 
ataaaaataa aaggactgag ctcaccaaac 
ttgtaccgtg ctcctgaggg tataacatat 
ttataaactt aatgccctgg caacctcaca 
cctcctccct cctcgctcgc gggccgggcc 
ctgaggcgga gcatggggcg gccggggctc 
ctcgggctgc tgctggcgga gtccgccgcc 
aagctgacag tgtctcaggg gcagccggtg 
gagcctgaca tccagtgggt gaaggatggg 
atcccagtca gcgagcagca ctggatcggc 
gacgccggcc ggtactggtg ccaggtggag 
gtgtggctca cggtagaagg tgtgccattt 
cca'cccaatg cccctttcca actgtcttgt 
attgtctggt ggagaggaac tacgaagatc 
aatgtaacag gggtgaccca gagcaccatg 
ctggcctctt ctcgcacagc cactgttcac 
atcaccgtga caaagctttc cagcagcaac 
ggccgagctc tgctacagtc ctgtacagtt 
gtcctggctg ttgtggtccc tgtgcccccc 
gccaccaact acagcctcag ggtgcgctgt 
gactgggtgc cctttcagac caagggtcta 
gccatccgca cagattcagg cctcatcttg 
ttggaaggcc ccctgggacc ctacaaactg 
gagctgacag tggaggggac cagggccaat 
atcgtacgtg tgtgcgtctc caatgcagtt 
gtctcttctc atgaccgtgc aggccagcag 
c.ctgtggtcc ttggtgtgct aacggccctg 
cttcgaaaga gacggaaaga gacgcggttt 
ggagagccag ccgttcactt ccgggcagcc 
atcgaggcca cattggacag cttgggcatc 
gtgctcatcc cagagcagca gttcaccctg 
tcagtgcggg aggcccagct gaagcaagag 
atgctgaaag ctgacatcat tgcctcaagc 
tgcatgaagg agtttgacca tccacacgtg 
agggctaaag gccgtctccc catccccatg 
ctgcatgcct tcctgctcgc ctcccggatt 
accctgatcc ggttcatggt ggacattgcc 
ttcatccacc gagacctggc tgctcggaat 
gtggctgact tcggactctc ccggaagatc 
ggagggctgg ctgagcagcc agggcaggca 
acacagaggc ttttgctgct gcagcaaggg 
ggcagagggc atcggggcca tttggccggc 
cccgtctgac cccagcccag acagcaaggt 
tgtgctggga agcccggact gaccaaatca 
gtggccagcc tggcatcagt ttaggccttg 
ctgaacccag gcagctggca ggagtggggt 
gatggcagtg tggggagggc aggtccagct 
cctgctgctt aagtgcatgc attgagctgc 



taagatttct agaaagtcaa aagatgataa 60 
ttatagaaaa aaaatcagtt aactatactc 120 
catttttaat aggcaagcca cagaaagggc 180 
gaaaaggaat tgaatctaga atatttaaca 240 
atatttttct tccaattggc aaattactta 300 
gtttaataaa catttgaagc cataataatg 360 
taagtaccac ttcacatcca caagaaaaag 420 
attggtgagg atgtggtaat actgaaattc 480 
tacaggattt ttttgaaaac tagtggttcc 540 
cctatttact taagaatgaa agggccccgc 600 
cggcatggtg cggcgtcgcc gccgatggcg 660 
ccgccgctgc cgctgccgcc gccaccgcgg 720 
gcaggtctga agctcatggg agccccggtg 7 80 
aagctcaact gcagtgtgga ggggatggag 840 
gctgtggtcc agaacttgga ccagttgtac 900 
ttcctcagcc tgaagtcagt ggagcgctct 960 
gatgggggtg aaaccgagat ctcccagcca 1020 
ttcacagtgg agccaaaaga tctggcagtg 1080 
gaggctgtgg gtccccctga acctgttacc 1140 
gggggacccg ctccctctcc atctgtttta 1200 
ttttcctgtg aagctcacaa cctaaaaggc 1260 
cttcaagcac tgcctgcagc ccccttcaac 1320 
gctagtgtgg cctggatgcc aggtgctgat 1380 
caggtgacac aggccccagg aggctgggaa 1440 
tttacctgcc tgctccggga cctggtgcct 1500 
gccaatgcct tggggccctc tccctatgct 1560 
gccccagcca gcgctcccca aaacctccat 1620 
gagtgggaag aagtgatccc cgaggcccct 1680 
tcctgggttc aagacaatgg aacccaggat 1740 
ttgacaggct gggatcccca aaaggacctg 1800 
ggctgtggac cctggagtca gccactggtg 1860 
ggccctcctc acagccgcac atcctgggta 1920 
gtgacggctg ctgccctggc cctcatcctg 1980 
gggcaagcct ttgacagtgt catggcccgg 2040 
cggtccttca atcgagaaag gcccgagcgc 2100 
agcgatgaac taaaggaaaa actggaggat 2160 
ggccggatgt tgggcaaagg agagtttggt 2220 
gatggctcct ttgtgaaagt ggctgtgaag 2280 
gacattgaag agttcctcag ggaagcagct 2340 
gccaaacttg ttggggtaag cctccggagc 2400 
gtcatcttgc ccttcatgaa gcatggggac 2460 
ggggagaacc cctttaacct acccctccag 2520 
tgcggcatgg agtacctgag ctctcggaac 2580 
tgcatgctgg cagaggacat gacagtgtgt 2640 
tacagtgact gtcggtacat actcaccccc 2700 
gagcaccagc cagagagtcc cctcaatgag 2760 
ctactgccac a r cagtagctg ttagcccaca 2820 
tctggtggcc actgagctgg ctgactaagc 2880 
gtggaggctc ctgtggtagt cctcccaagc 2940 
cccaatccca gttcttcctg caaccactct 3000 
gcttgatgga agtgggccag tcctggttgt 3060 
ggttatgttt ccatggttac catgggtgtg 3120 
ctgtgggccc taccctcctg ctgagctgcc 3180 
ctccagcctg gtggcccagc tattaccaca 3240 
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cttggggttt 
tcccttttag 
tctgattccg 
gaccgggtca 
gcagcgtgcc 
gccggagagg 
caaaatatcc 
cctttagtgc 
aaatcaggcc 
tctggaatgc 
ttaaaataga 



aaatatccag 
gtgagggttg 
caccctgcct 
cacaaaggca 
gagccagcaa 
agttcaggaa 
taagactaac 
caacttcccc 
tgatgagggg 
atggggcggg 
aataaaattg 



gtgtgcccct 
gtaaggggtt 
acgccaggag 
tgctggagta 
gaggaagggg 
cccttctcca 
aaaggcagct 
tctaactgga 
gaattcctgg 
tcctagctgt 
aagactaaag 



ccaagtcaca 
ggtatctcag 
aagttgaggg 
cccagcctat 
tgctgtgagg 
tacccacaat 
gtgtctgagc 
cagcctcttc 
aacctggacc 
tagggacatt 
acctaaa 



aagagatgtc 
gtctgaatct 
gagcatgctt 
caggtgcccc 
cttgcccagg 
ctgagcacgc 
ccaacccttc 
tgtcccaagt 
ccagccttgg 
tccaagctgt 



cttgtaatat 
tcaccatctt 
ccctgcagct 
tcttccaaag 
agcaagtgag 
taccaaatct 
taaacggtga 
ctccagagag 

tgggggagcc 

tagttgctgt 



3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3877 



<210> 20 
<211> 3532 
<212> DNA 
<213> Humanus 



<400> 20 

ttcccctccc 

gcgagctgat 

aacagaccca 

gagcgtctgc 

cggagcgcag 

ggaggtggaa 

cgtggttttg 

gctccgggcc 

acgctcacac 

tctgttttat 

cgctctccat 

ggaagggggt 

tcctcctcct 

ccctctccgc 

cggccctgct 

ttcctgaaga 

tcaatgtgaa 

ctgcagattt 

cttctgagca 

aagtggcagt 

cagaagttga 

ggcagaagag 

ttcctcaaga 

gtgtaactgg 

cgggtcagct 

gggcacatgc 

atgttattga 

ttcctgaggg 

atcccaatgc 

cttcacccaa 

gacttgatcg 

gcaatcccac 

atgacaatcc 

tagacatcat 

ggaacgcagt 

acccaaacag 

ggatgtttgt 

acccgcctca 

attttgcccc 

tgacaacatt 

aattatctga 

ttgctgtttt 



ctccccaccc 
gacaaatagc 
gagatcaagg 
gagtgtgtgg 
cggaagccgc 
gtggagcgga 
cttcctccgg 
ggagccgagc 
gctctccctc 
tactcctggt 
tggcggggag 
ggaaactgcc 
ccgccgccgc 
ctccatgtgc 
tcaggcgtct 
tgtttacagt 
gtttagcaac 
taaggtggat 
tgccaagttc 
aaaattgagc 
agaaatagtg 
agactgggtc 
gcttgtcagg 
gccaggagct 
gtcggtgaca 
agtagatatt 
catgaatgac 
atcaaagcct 
cctcaatggg 
catgtttaca 
agaaaaagtg 
atatggcctt 
tccagagttt 
agtagctaat 
gtacagaatc 
caacgacggg 
ccttactgtt 
gtcaactgca 
caatcctaag 
cactgctcag 
tcctgccaat 
ggaccgagaa 



ccttttttcg 
gggccgcgga 
agctggggag 
aggcggctgc 
gagggatgca 
tggcgctccc 
ccaaggtccc 
cgcctgcgct 
cctgttcccc 
gcgagtcccg 
cggagagcag 
tggagccgtt 
cgccgccgcc 
cggatagcgg 
gtagaggctt 
gcagtcttat 
tgcaatggaa 
gaagatggca 
ctgatatatg 
ctgaagccaa 
ttcccaagac 
atccctccaa 
atcaggtctg 
gaccagcctc 
aagcccctgg 
aatggaaatc 
aacagacctg 
ggaacatatg 
atgttgaggt 
atcaacaatg 
caacagtata 
tcaaacacag 
actgccatga 
ctaactgtga 
agtggcggag 
ttagtcaccg 
gctgcagaaa 
accgtgtctg 
atcattcgcc 
gacccagatc 
tggctaaaaa 
tcaccaaatg 



ctgctctccc 
gtccgcggga 
ggggcggggg 
tgtggcagcg 
gcggcgggga 
caagagctcc 
cgccgctgca 
gccacagcca 
cgccccctcc 
cggactccgg 
cgaagaaggg 
tctcgcgccg 
gccgccgcct 
gagcgctgcg 
ctggtgaaat 
cgaaggatgt 
aaagaaaagt 
tggtgtatgc 
cccaagacaa 
ccttaactga 
aattcagtaa 
tcaacttgcc 
atagagataa 
caactggtat 
atcgcgagca 
aagtggagaa 
agttcttaca 
tgatgaccgt 
acagaatcgt 
agactggtga 
cgttaataat 
ccacggccgt 
cgttttatgg 
ccgataagga 
atcctactgg 
tggtcaaacc 
atcaagtgcc 
ttacagttat 
aagaagaagg 
gatatatgca 
tagatcctgt 
tgaaaaacaa 



gcttccccgc 
cttcgcacca 
aacagggagg 
caggcggctc 
ccttggccgg 
gccacgcgag 
tccctcgcgg 
gcgcctccac 
ccagctcctt 
cccgctattt 
ggtggggagg 
ctgttggtgc 
cctccggctc 
gaccctgctg 
cgcattatgc 
gcatgaagga 
acaatatgag 
cgtgagaagc 
agagacccag 
ggagtcagtg 
gcacagtggc 
agaaaactcc 
aaacctttca 
cttcattatc 
gatagcccgg 
ccccattgac 
ccaggtttgg 
aacagcaatt 
gtctcaggct 
catcatcaca 
tcaagctaca 
catcacagtg 
tgaagttcct 
tcaaccccat 
acggttcgcc 
aatcgacttt 
attagccaag 
tgacgtaaat 
gcttcatgcc 
gcaaaatatt 
gaatggacaa 
tatatataat 



caatggagaa 
ggagtaataa 
gagagcgtgt 
ggctccggcc 
tggaggatgt 
gtttcgggct 
cttcccgtgc 
acactcgcag 
gatctctggg 
gtcatcagct 
ggaggggaag 
tgccgctgcc 
ttcgctcggc 
ccgctgctgg 
aagactggat 
cagcctcttc 
agcagtgagc 
tttccactct 
gaaaagtggc 
aaggagtcag 
cacctacaaa 
aggggacctt 
ctgcggtaca 
aaccccatct 
tttcatttga 
attgtcatca 
aatgggacag 
gatgctgacg 
ccaagcaccc 
gtggcagctg 
gacatggaag 
acagatgtca 
gagaacaggg 
acaccagcct 
atccagaccg 
gaaacaaata 
ggaattcagc 
gaaaaccctt 
ggtaccatgt 
agatacacta 
ataactacaa 
gctactttcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 
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ttgcttctga caatggaatt 
ttgatattaa tgacaatgcc 
cagaccccaa ttcaattaat 
catttgcttt tgatcttcct 
ggcttaatgg tgattttgct 
atgaagttcc catcataatc 
tgcgcgtgaa ggtttgccag 
tgggtgcggg gcttggcacc 
ttatccttgt gctgatgttt 
aacaactttt aattgatcca 
aaggtggagg agaagaagac 
tggagcctga tgccatcaag 
ccgagcccca gtatccggtc 
ttaatgagaa aacctggccc 
atgatttcag caatgcatct 
acatcagaga agagtgccag 
cctatgtact tgcattttca 



cctcctatga gtggaacagg 
cctcaagtgt tacctcaaga 
attacagcac ttgattatga 
ttatctccag tgactattaa 
cagcttaatt taaagataaa 
acagattcgg gtaatcctcc 
tgtgactcca acggggactg 
ggtgccatca-ttgccatcct 
gtggtatgga tgaaacgccg 
gaagatgatg taagagataa 
caggactatg acttgagcca 
cctgtgggaa tccgacgaat 
cgatctgcag ccccacaccc 
atacagagct tacacctatg 
atcatgcctg cctttggacc 
ggtcaacaat taatatttta 
gcaatattgc agcatagtat 



aacgctgcag atctatttac 2580 
ggcagagact tgcgaaactc 2640 
cattgatcca aatgctggac 2700 
gagaaattgg accatcactc 2760 
atttcttgaa gctggtatct 2820 
caaatcaaat atttccatcc 2880 
cacagatgtg gacaggattg 2940 
gctctgcatc atcatcctgc 3000 
ggataaagaa cgccaggcca 3060 
tattttaaaa tatgatgaag 3120 
gctgcagcag cctgacactg 3180 
ggatgaaaga cccatccacg 3240 
tggagacatt ggggacttca 3300 
accttggctt cgtgggcacc 3360 
taatgagtat gaaccacatt 3420 
gagttacaac tacatgtgaa 3480 
tatttatctc ta 3532 



<210> 21 
<211> 3069 
<212> DMA 
<213> Humanus 



agattatgat cgcctgaggc ccctctccta cccagatacc gatgttatac tgatgtgttt 60 
ttcctttttt tttttttttt tttaagtaat taagggtagt taaattattt aaagtataca 120 
aagtccaaac agccaggggt aaggtctcca agaggccttc ccagggtaag ggagtgcgga 180 
gaggccccgg tcgccacccg cggtgcccat ggagcgggtg aagatgatca acgtgcagcg 240 
tctgctggag gctgccgagt ttttggagcg ccgggagcga gagtgtgaac atggctacgc 300 
ctcttcattc ccgtccatgc cgagcccccg actgcagcat tcaaagcccc cacggaggtt 360 
gagccgggca cagaaacaca gcagcgggag cagcaacacc agcactgcca acagatctac 420 
acacaatgag ctggaaaaga atcgcttgaa gaattggttg gttgggagaa gggataccag 480 
aggcatgaag atgcttttga aggctattgc agtaatctag acgagctcat ctgcgccttt 540 
gtttagaacg cttaaaagtt ctgattccac taggaccaga ctgcacccgg cacacaacac 600 
ttggtttgct caacaaagcc aaagcacaca tcaagaaact tgaagaagct gaaagaaaaa 660 
gccagcacca gctcgagaat ttggaacgag aacagagatt tttaaagtgg cgactggaac 720 
agctgcaggg tcctcaggag atggaacgaa tacgaatgga cagcattgga tcaactattt 780 
cttcagatcg ttctgattca gagcgagagg agattgaagt ggatgttgaa agcacagagt 840 
tctcccatgg agaagtggac aatataagta ccaccagcat cagtgacatt gatgaccaca 900 
gcagcctgcc gagtattggg agtgacgagg gttactccag tgccagtgtc aaactttcat 960 
tcacttcata gaacccagca tgacataaca gtgcagggca aaatattcac tgggccaatt 1020 
caatacaaac aatctcttaa attgggttca tgatgcagtc tcctctttaa aacaaaacaa 1080 
aacaaaacaa aactatactt gaacaaaagg gtcagaggac ctgtatttaa gcaaatactt 1140 
agcaaaaagt ggggcagagc tcccaaggag aacaaatatt cagaatattc atattggaaa 1200 
aatcacaatt tttaatggca gcagaaaact tgtgtgaaat tttcttgatt tgagttgatt 1260 
gagaagagga cattggagat gccatcctct ttctcttttc tagtttgctc atactacatt 1320 
gagtagacac atttaaggat ggggttatga acccttcctg agctttatgg tcctaaaagc 1380 
Laataaaaa ctattcgaat gaaaagacaa gaaaatcagg tattaatctt g^agctaa 1440 
taatgagcta ttaaaactca gcctgggaca gtttatcatg aagcctgtgg atgatcaatc 1500 
ctttattatt attttttttt ttttgaaaaa agctcatttc atgctctgca aaaggagaga 1560 
ctcccatgaa gccttttgaa agggatcatc atgcagctca actttctgtt gg at tccatg 1620 
ctaagcaagc taaccttatc ctgcattgtt agcactaggc acccagctgc cacctctcca 1680 
tcctgctgL cttaggccac atgggagcag tccatgcatg acagcctc a J°^acaagg 1740 
cctatgagta tggattgggg gggccaaaag gaaaaagctc catgtgcctc "tgtctgcg 1800 
tgggtcagaa gagttgtgca cgcagattag caggccaagg tctgagccac ^cagcattt 1860 
ttatttcaga ttttgataac tgtttatatg tgttgaaaac caaaatgaca tctttttaaa 1920 
gcttatccat aaaaaaaaat agatgtcttt tatagtggaa aaacacatgg TO"""^ 1980 
catctatttt gatgcagcat ttgataatga taaaacacct cacacctcac Jctttatagt 2040 
gcacaaaatg aatgaggtct gggctaggta gaaaaagggt caatgctatt tfct ^""^ ^100 
gaatcattac cttttaccag cttttaacca tctgatatct atagtagaca cactatcata 2160 



16 



gttaacatag 
ctacaggcag 
gaatattttc 
atgtgttccc 
gaaggcctct 
actaccctta 
gattttaatg 
gcaaggaaaa 
gttagactat 
accatcttgg 
gaagtttttc 
tgaaaaactg 
gacttaatga 
tttcacttat 
atcaagtgtg 
aagaaacaa 



ttaagttcag 
tctctctctt 
agtaatgtta 
tttgtctttc 
tggagacctt 
attacttaaa 
gattgttaat 
agaaaataaa 
ccctttgtga 
tcatacatta 
cttgattgtg 
catgccttta 
aaaacaggac 
aattac'taat 
agacagtttg 



cacttgtctc 
cctcacagtc 
ttttcttcta 
aaactccaag 
acccctggct 
aaaaaaaaaa 
ttcagtcctg 
gacaaccata 
gtgacacttt 
cgcctcaaca 
ctatgtttca 
gaagcccagt 
aaaacaattc 
ggctgagtca 
aagactgtgc 



attttaatgt 
ccactgtgca 
agtgaaattt 
gttcccctgt 
gtttggactt 
aaagctttat 
tagttttatt 
tttagcagtg 
aacagcattc 
tatacttgtg 
gtggaagaaa 
atcagaactt 
ctttttgtgg 
agatgttgtc 
taccatacaa 



aaagatttgc 
ggtgctattg 
ctagcctgca 
ggccctctcc 
tgtatacttt 
gattttcata 
ttatgtttag 
cagttgagtt 
actgcttcta 
ctcttccttt 
ttctttgaag 
gctacgtttc 
cccaggtaaa 
tctgtgtttg 
agtgaatgaa 



ttccattttc 
ttactcttac 
ctttgatgtc 
cttaccctgg 
aaataattta 
acttattgct 
atagggctgg 
gtgtgttaat 
tatatagtgt 
gcctccagaa 
tagatgtgag 
aggtgctagg 
ttatttctgg 
cttactcttg 
gccagtgact 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3069 



<210> 22 
<211> 2971 
<212> DNA 
<213> Humanus 



<400> 22 

cggccgcgca 

gtgccccccg 

aagcccaggt 

atcaacactt 

atcgagaaag 

agcccccgac 

agcgggagca 

cgacgagctc 

gactgcaccc 

cttgaagaag 

tttttaaagt 

gacagcattg 

gtggatgttg 

atcagtgaca 

agtgccagtg 

caaaatattc 

tctcctcttt 

acctgtattt 

ttcagaatat 

attttcttga 

tctagtttgc 

tgagctttat 

ggtattaatc 

tgaagcctgt 

tcatgctctg 

caactttctg 

gcacccagct 

tgacagcctc 

tccatgtgcc 

ggtctgagcc 

accaaaatga 

aaaaacacat 

ctcacacctc 

gtcaatgcta 

ctatagtaga 

gtaaagattt 

caggtgctat 



aggaggcgcg 
ccgtggccgc 
gccccttctc 
ttctgcagaa 
aaaacaaaaa 
tgcagcattc 
gcaacaccag 
atctgcgcct 
ggcacacaac 
ctgaaagaaa 
ggcgactgga 
gatcaactat 
aaagcacaga 
ttgatgacca 
tcaaactttc 
actgggccaa 
aaaacaaaac 
aagcaaatac 
tcatattgga 
tttgagttga 
tcatactaca 
ggtcctaaaa 
ttggatagct 
ggatgatcaa 
caaaaggaga 
ttggattcca 
gccacctctc 
tatcctacaa 
tctttgtctg 
acagcagcat 
catcttttta 
ggggaaaaaa 
actctttata 
tttttgtttt 
cacactatca 
gcttccattt 
tgttactctt 



ctgcgagggc 
gccccagccc 
agacattttc 
cgtgcagatt 
gtgtgaacat 
aaagccccca 
cactgccaac 
ttgtttagaa 
acttggtttg 
aagccagcac 
acagctgcag 
ttcttcagat 
gttctcccat 
cagcagcctg 
attcacttca 
ttcaatacaa 
aaaacaaaac 
ttagcaaaaa 
aaaatcacaa 
ttgagaagag 
ttgagtagac 
gcaaaataaa 
aataatgagc 
tcctttatta 
gactcccatg 
tgctaagcaa 
catcctgctg 
ggcctatgag 
cgtgggtcag 
ttttatttca 
aagcttatcc 
atcatctatt 
gtgcacaaaa 
tagaatcatt 
tagttaacat 
tcctacaggc 
acgaatattt 



gcggggctgg 
ccggccctgc 
aacaccagcg 
ctgctcgagg 
ggctacgcct 
cggaggttga 
agatctacac 
cgcttaaaag 
ctcaacaaag 
cagctcgaga 
ggtcctcagg 
cgttctgatt 
ggagaagtgg 
ccgagtattg 
tagaacccag 
acaatctctt 
aaaactatac 
gtggggcaga 
tttttaatgg 
gacattggag 
acatttaagg 
aactattcga 
tattaaaact 
ttattttttt 
aagccttttg 
gctaacctta 
cccttaggcc 
tatggattgg 
aagagttgtg 
gattttgata 
ataaaaaaaa 
ttgatgcagc 
tgaatgaggt 
accttttacc 
agttaagttc 
agtctctctc 
tcagtaatgt 



cccccgccgc 
ccgaggaccc 
agaactcgat 
ccgccagcta 
cttcattccc 
gccgggcaca 
acaatgagct 
ttctgattcc 
ccaaagcaca 
atttggaacg 
agatggaacg 
cagagcgaga 
acaatataag 
ggagtgacga 
catgacataa 
aaattgggtt 
ttgaacaaaa 
gctcccaagg 
cagcagaaaa 
atgccatcct 
atggggttat 
atgaaaagac 
cagcctggga 
ttttttgaaa 
aaagggatca 
tcctgcattg 
acatgggagc 
gggggccaaa 
cacgcagatt 
actgtttata 
atagatgtct 
atttgataat 
ctgggctagg 
agcttttaac 
agcacttgtc 
ttcctcacag 
tattttcttc 



gcccccggct 
cgctggggcc 
ggagaagcac 
cctggagcag 
gtccatgccg 
gaaacacagc 
ggaaaagaat 
actaggacca 
catcaagaaa 
agaacagaga 
aatacgaatg 
ggagattgaa 
taccaccagc 
gggttactcc 
cagtgcaggg 
catgatgcag 
gggtcagagg 
agaacaaata 
cttgtgtgaa 
ctttctcttt 
gaacccttcc 
aagaaaatca 
cagtttatca 
aaagctcatt 
tcatgcagct 
ttagcactag 
agtccatgca 
aggaaaaagc 
agcaggccaa 
tgtgttgaaa 
tttatagtgg 
gataaaacac 
tagaaaaagg 
catctgatat 
tcattttaat 
tcccactgtg 
taagtgaaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 
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ttctagcctg 
gtggccctct 
tttgtatact 
atgattttca 
ttttatgttt 
tgcagttgag 
tcactgcttc 
tgctcttcct 
aattctttga 
ttgctacgtt 
ggcccaggta 
tctctgtgtt 
aaagtgaatg 



cactttgatg 
cccttaccct 
ttaaataatt 
taacttattg 
agatagggct 
ttgtgtgtta 
tatatatagt 
ttgcctccag 
agtagatgtg 
tcaggtgcta 
aattatttct 
tgcttactct 
aagccagtga 



tcatgtgttc 
gggaaggcct 
taactaccct 
ctgattttaa 
gggcaaggaa 
atgttagact 
gtaccatctt 
aagaagtttt 
agtgaaaaac 
gggacttaat 
ggtttcactt 
tgatcaagtg 
ctaagaaaca 



cctttgtctt 
cttggagacc 
taattactta 
tggattgtta 
aaagaaaata 
atccctttgt 
ggtcatacat 
tccttgattg 
tgcatgcctt 
gaaaaacagg 
ataattacta 
tgagacagtt 



tcaaactcca 
ttacccctgg 
aaaaaaaaaa 
atttcagtcc 
aagacaacca 
gagtgacact 
tacgcctcaa 
tgctatgttt 
tagaagccca 
acaaaacaat 
atggctgagt 
tgaagactgt 



aggttcccct 
ctgtttggac 
aaaaagcttt 
tgtagtttta 
tatttagcag 
ttaacagcat 
catatacttg 
cagtggaaga 
gtatcagaac 
tcctttttgt 
caagatgttg 
gctaccatac 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
2971 



<210> 23 
<211> 1394 
<212> DNA 
<213> Humanus 

<400> 23 

actcactata gggctcgagc ggccgcccgg gcaggtgtct gcccgcagcc atgagcgtgc 60 

tcggccccgs tsgagyccry wgtsctctaa wkagtctcca ccgccgtcca ggacccactt 120 

gcagcatgga gtcgcccgcc tcgagccagc ccgccagcat gccccagtcc aaaggaaaat 180 

ccaagaggaa gaaggatcta cggatatcct gcatgtccaa gccacccgca cccaacccca 240 

cacccccccg gaacctggac tcccggacct tcatcaccat tggagacaga aactttgagg 300 

tggaggctga tgacttggtg accatctcag aactgggccg tggagcctat ggggtggtag 360 

agaaggtgcg gcacgcccag agcggcacca tcatggccgt gaagcggatc cgggccaccg 420 

tgaactcaca ggagcagaag cggctgctca tggacctgga catcaacatg cgcacggtcg 480 

actgtttcta cactgtcacc ttctacgggg cactattcag agagggagac gtgtggatct 540 

gcatggagct catggacaca tccttggaca agttctaccg gaaggtgctg gataaaaaca 600 

tgacaattcc agaggacatc cttggggaga ttgctgtgtc tatcgtgcgg gccctggagc 660 

atctgcacag caagctgtcg gtgatccaca gagatgtgaa gccctccaat gtccttatca 720 

acaaggaggg ccatgtgaag atgtgtgact ttggcatcag tggctacttg gtggactctg 780 

tggccaagac gatggatgcc ggctgcaagc cctacatggc ccctgagagg atcaacccag 840 

agctgaacca gaagggctac aatgtcaagt ccgacgtctg gagcctgggc atcaccatga 900 

ttgagatggc catcctgcgg ttcccttacg agtcctgggg gaccccgttc cagcagctga 960 

agcaggtggt ggaggagccg tccccccagc tcccagccga ccgtttctcc cccgagtttg 1020 

tggacttcac tgctcagtgc ctgaggaaga accccgcaga gcgtatgagc tacctggagc 1080 

tgattggagc tgatcgcttc tcccccaccc cctagggtac cagcaggcag agccttgccc 1140 

tctgctcagg ctggggtcca gtgggagggg cccaagatct ctgctcagag aagtgcaggg 1200 

ggagccttcc agctcactct ccctgaggac tggcttgaca ggggctatgg gtttgctttg 1260 

gtgttgtttt taaaaaaaga aaatatattt ttttgaaaaa acgactgccc atcccgggtc 1320 

ctttccctga tgggttgggg cagttacctg gttgctgttt taattaaaaa ctttagagca 1380 

caatggatct cgag 1394 



60 



<210> 24 
<211> 1717 
<212> DNA 
<213> Humanus 

<400> 24 

tcccttggcc ccaggagaca ggggttgcag aaagccgaga tcgtgccact gcactccatc 
ctgggtgaga gagcaagacc ctgtctcaac aaaaaatttt taaaaaataa aataaataat 120 
aatacagcaa aaagatttgc tttctcggct tcagtgtggg cggtaactcc atcgtgcaat 180 
gagaaaggcg aatttcttcc agacaccaat cccggaggtc gcttctgttg ctaggctccc 240 
agaaagcagg gttcggacgt cattgggagg cgaggctaga gcggggttgt gtgtggcgga 300 
gggaggcggg gctggaggaa acgctcgttg ctaaggaacg cagcgctctt cccgctctgg 360 
agaggcgcga ctgggcttgc gcagtgtcga cgccggcgcc ggcgcgccgg ggtttgaaag 420 
gcccgagcct cgcgcgcttg cgcactttag ccagcgcagg gcgcaccccg ctccctccca 480 



18 



ctctccctgc ccctcggacc ccatactcta cctcatcctt ctggccaggc gaagcccacg 540 
acgttgacat gccggagatc cgcctccgcc atgtcgtgtc ctgcagcagc caggactcga 600 
ctcactgtgc agaaaatctt ctcaaggcag acacttaccg aaaatggcgg gcagccaagg d60 
caggcgagaa gaccatctct gtggtcctac agttggagaa ggaggagcag atacacagtg 720 
tggacattgg gaatgatggc tcagctttcg tggaggtgct ggtgggcagt tcagctggag 780 
gcgctgggga gcaagactat gaggtccttc tggtcacctc atctttcatg tccccttccg 840 
agagccgcag tggctcaaac cccaaccgcg ttcgcatgtt tgggcctgac aagctggtcc 900 
gggcagccgc cgagaagcgc tgggaccggg tcaaaattgt ttgcagccag ccctacagca 960 
aggactcccc ctttggcttg agttttgtac ggtttcatag ccccccagac aaagatgagg 1020 
cagaggcccc gtcccagaag gtgacagtga ccaagcttgg ccagttccgt gtgaaggagg 1080 
aggatgagag cgccaactct ctgaggccgg gggctctctt cttcagccgg atcaacaaga 1140 
catccccagt cacagccagc gacccggcag gacctagcta tgcagctgct accctccagg 1200 
cttctagtgc tgcctcctca gcctctccag tctccagggc cataggcagc acctccaagc 1260 
cccaggagtc tccctggcat tcgttcgtcc ccgatggatc tacagttgca atgagaagca 1320 
gaagttactt cctcaccagc tctatggggt ggtgccgcaa gcctgaagta tgtgctatac 1380 
acacacacac acacacacac acacacacac acacacgatg catttaataa agatgagttg 1440 
gttctcatcc aagagtctcc caaaactcta agaggctccc tgggacctgg ggaagaatgc 1500 
tgggcacctc cgtcagagat ctggtacaca aggaactctt tgtctcttct gcttggcccc 1560 
ttatccctgt gttggcaaga ggcagggaac tgggaatctg accctcagca ctgcccctca 1620 
actttttctg gccctctgag ccacacctgt atcttggctg tccctttgtg gctggannnt 1680 
gggtacccat gaggcttgtc tctctcctga agcctca 1717 



<210> 25 
<211> 1346 
<212> DNA 
<213> Humanus 



<400> 25 

tcccttggcc 

ctgggtgaga 

aatacagcaa 

gagaaaggcg 

agaaagcagg 

gggaggcggg 

agaggcgcga 

gcccgagcct 

ctctccctgc 

acgttgacat 

ctcactgtgc 

caggcgagaa 

tggacattgg 

gcgctgggga 

agagccgcag 

gggcagccgc 

aggactcccc 

cagaggcccc 

aggatgagag 

catccccagt 

cttctagtgc 

cccaggagtc 

tccccatccc 



ccaggagaca 
gagcaagacc 
aaagatttgc 
aatttcttcc 
gttcggacgt 
gctggaggaa 
ctgggcttgc 
cgcgcgcttg 
ccctcggacc 
gccggagatc 
agaaaatctt 
gaccatctct 
gaatgatggc 
gcaagactat 
tggctcaaac 
cgagaagcgc 
ctttggcttg 
gtcccagaag 
cgccaactct 
cacagccagc 
tgcctcctca 
atcagacttt 
ctcagcccca 



ggggttgcag 
ctgtctcaac 
tttctcggct 
agacaccaat 
cattgggagg 
acgctcgttg 
gcagtgtcga 
cgcactttag 
ccatactcta 
cgcctccgcc 
ctcaaggcag 
gtggtcctac 
tcagctttcg 
gaggtccttc 
cccaaccgcg 
tgggaccggg 
agttttgtac 
gtgacagtga 
ctgaggccgg 
gacccggcag 
gcctctccag 
ggtggggtgg 
gggagt 



aaagccgaga 
aaaaaatttt 
tcagtgtggg 
cccggaggtc 
cgaggctaga 
ctaaggaacg 
cgccggcgcc 
ccagcgcagg 
cctcatcctt 
atgtcgtgtc 
acacttaccg 
agttggagaa 
tggaggtgct 
tggtcacctc 
ttcgcatgtt 
tcaaaattgt 
ggtttcatag 
ccaagcttgg 
gggctctctt 
gacctagcta 
tctccagggc 
aggaggagag 



tcgtgccact 
taaaaaataa 
cggtaactcc 
gcttctgttg 
gcggggttgt 
cagcgctctt 
ggcgcgccgg 
gcgcaccccg 
ctggccaggc 
ctgcagcagc 
aaaatggcgg 
ggaggagcag 
ggtgggcagt 
atctttcatg 
tgggcctgac 
ttgcagccag 
ccccccagac 
ccagttccgt 
cttcagccgg 
tgcagctgct 
cataggcagc 
aagctggagg 



gcactccatc 
aataaataat 
atcgtgcaat 
ctaggctccc 
gtgtggcgga 
cccgctctgg 
ggtttgaaag 
ctccctccca 
gaagcccacg 
caggactcga 
gcagccaagg 
atacacagtg 
tcagctggag 
tccccttccg 
aagctggtcc 
ccctacagca 
aaagatgagg 
gtgaaggagg 
atcaacaaga 
accctccagg 
acctccaagc 
cctcaatcca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1346 



<210> 26 
<211> 1643 
<212> DNA 
<213> Humanus 

<400> 26 

tcccttggcc ccaggagaca ggggttgcag aaagccgaga tcgtgccact gcactccatc 60 



19 



ctgggtgaga gagcaagacc ctgtctcaac 
aatacagcaa aaagatttgc tttctcggct 
gagaaaggcg aatttcttcc agacaccaat 
agaaagcagg gttcggacgt cattgggagg 
gggaggcggg gctggaggaa acgctcgttg 
agaggcgcga ctgggcttgc gcagtgtcga 
gcccgagcct cgcgcgcttg cgcactttag 
ctctccctgc ccctcggacc ccatactcta 
acgttgacat gccggagatc cgcctccgcc 
ctcactgtgc agaaaatctt ctcaaggcag 
caggcgagaa gaccatctct gtggtcctac 
tggacattgg gaatgatggc tcagctttcg 
gcgctgggga gcaagactat gaggtccttc 
agagccgcag tggctcaaac cccaaccgcg 
gggcagccgc cgagaagcgc tgggaccggg 
aggactcccc ctttggcttg agttttgtac 
cagaggcccc gtcccagaag gtgacagtga 
aggatgagag cgccaactct ctgaggctcg 
tgatcacagc acaggaatgg gatcccagct 
tggcattcgt tcgtccccga tggatctaca 
accagctcta tggggtggtg ccgcaagcct 
acacacacac acacacacac acgatgcatt 
gtctcccaaa actctaagag gctccctggg 
agagatctgg tacacaagga actctttgtc 
gcaagaggca gggaactggg aatctgaccc 
tctgagccac acctgtatct tggctgtccc 
cttgtctctc tcctgaagcc tea 



aaaaaatttt taaaaaataa aataaataat 120 
tcagtgtggg cggtaactcc atcgtgcaat 180 
cccggaggtc gcttctgttg ctaggctccc 240 
cgaggctaga gcggggttgt gtgtggcgga 300 
ctaaggaacg cagcgctctt cccgctctgg 360 
cgccggcgcc ggcgcgccgg ggtttgaaag 420 
ccagcgcagg gcgcaccccg ctccctccca 480 
cctcatcctt ctggccaggc gaagcccacg 540 
atgtcgtgtc ctgcagcagc caggactcga 600 
acacttaccg aaaatggcgg geagecaagg 660 
agttggagaa ggaggagcag atacacagtg 720 
tggaggtgct ggtgggcagt tcagctggag 7 80 
tggtcacctc atctttcatg tccccttccg 840 
ttcgcatgtt tgggectgae aagctggtcc 900 
tcaaaattgt ttgcagccag ccctacagca 960 
ggtttcatag ccccccagac aaagatgagg 1020 
ccaagcttgg ccagttccgt gtgaaggagg 1080 
aggactatat gagtgacegg gttcagtttg 1140 
ttgaggaggc cctgatggac aacccctccc 1200 
gttgcaatga gaagcagaag ttacttcctc 1260 
gaagtatgtg ctatacacac acacacacac 1320 
taataaagat gagttggttc tcatccaaga 138 0 
acctggggaa gaatgctggg cacctccgtc 1440 
tettctgett ggccccttat ccctgtgttg 1500 
tcagcactgc ccctcaactt tttctggccc 1560 
tttgtggctg gannntgggt acccatgagg 1620 

1643 



<210> 27 
<211> 2713 
<212> DNA 
<213> Humanus 



<400> 27 

tcccttggcc 

ctgggtgaga 

aatacagcaa 

gagaaaggcg 

agaaagcagg 

gggaggcggg 

agaggcgcga 

gcccgagcct 

ctctccctgc 

acgttgacat 

ctcactgtgc 

caggcgagaa 

tggacattgg 

gcgctgggga 

agagccgcag 

gggcagccgc 

aggactcccc 

cagaggcccc 

aggatgagag 

catccccagt 

cttctagtgc 

cccaggagtc 

ccagcaaacc 

caactcgtac 

aaccccgagg 

agatccttca 



ccaggagaca 
gagcaagacc 
aaagatttgc 
aatttcttcc 
gttcggacgt 
gctggaggaa 
ctgggcttgc 
cgcgcgcttg 
ccctcggacc 
gccggagatc 
agaaaatctt 
gaccatctct 
gaatgatggc 
gcaagactat 
tggctcaaac 
cgagaagcgc 
ctttggcttg 
gtcccagaag 
cgccaactct 
cacagccagc 
tgcctcctca 
tcccaaaggg 
accagcccag 
cccagccaca 
agaaggcacc 
gggtgtggta 



ggggttgcag 
ctgtctcaac 
tttctcggct 
agacaccaat 
cattgggagg 
acgctcgttg 
gcagtgtcga 
cgcactttag 
ccatactcta 
cgcctccgcc 
ctcaaggcag 
gtggtcctac 
tcagctttcg 
gaggtccttc 
cccaaccgcg 
tgggaccggg 
agttttgtac 
gtgacagtga 
ctgaggcegg 
gacccggcag 
gcctctccag 
aagaggaagt 
ctgtcgccat 
gccccagtcc 
gagcccagac 
gtggtgctga 



aaagecgaga 
aaaaaatttt 
tcagtgtggg 
cccggaggtc 
cgaggctaga 
ctaaggaacg 
cgccggcgcc 
ccagcgcagg 
cctcatcctt 
atgtcgtgtc 
acacttaccg 
agttggagaa 
tggaggtgct 
tggtcacctc 
ttcgcatgtt 
tcaaaattgt 
ggtttcatag 
ccaagcttgg 
gggctctctt 
gacctagcta 
tctccagggc 
tggatttgaa 
ctgttcccaa 
ctgcccgagc 
gaccccgagc 
gtggcttcca 



tcgtgccact 
taaaaaataa 
cggtaactcc 
gcttctgttg 
gcggggttgt 
cagcgctctt 
ggcgcgccgg 
gcgcaccccg 
ctggccaggc 
ctgcagcagc 
aaaatggcgg 
ggaggagcag 
ggtgggcagt 
atctttcatg 
tgggectgae 
ttgcagccag 
ccccccagac 
ccagttccgt 
cttcagccgg 
tgeagctget 
cataggcagc 
ccaagaagaa 
gagacctaaa 
acagggggca 
tggeccagag 
gaaccccttc 



gcactccatc 
aataaataat 
atcgtgcaat 
ctaggctccc 
gtgtggcgga 
cccgctctgg 
ggtttgaaag 
ctccctccca 
gaagcccacg 
caggactcga 
geagecaagg 
atacacagtg 
tcagctggag 
tccccttccg 
aagctggtcc 
ccctacagca 
aaagatgagg 
gtgaaggagg 
atcaacaaga 
accctccagg 
acctccaagc 
aagaagaccc 
ttgccagctc 
gtgacaggca 
gagctgggga 
cgctccgagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 



20 



tgcgagataa ggccctagag cttggggcca 
cgcacctcat ctgtgccttt gccaacaccc 
gccgcatcgt gcgtaaggag tgggtgctgg 
cccggaggta cctcatggca gggccaggtt 
gcggtggcag cggagatgaa gcccccaagc 
agcccactca ggcagctgga cccagctcac 
aagcagcctc accagtgctc caggaagata 
aggacaatgg ggcggaagat tctggggaca 
agaaggaaca cagactgccc cctggccagg 
ccacggatga gaacacggac agtgaggaac 
agctccccag atttcttcca gggcaagcac 
gagcggcgga aactcatccg atacgtcaca 
tcgtccccga tggatctaca gttgcaatga 
tggggtggtg ccgcaagcct gaagtatgtg 
acacacacac acgatgcatt taataaagat 
actctaagag gctccctggg acctggggaa 
tacacaagga actctttgtc tcttctgctt 
gggaactggg aatctgaccc tcagcactgc 
acctgtatct tggctgtccc tttgtggctg 
tcctgaagcc tea 



agtateggee agactggacc egggacagea 1620 
ccaagtacag ccaggtccta ggcctgggag 1680 
actgtcaccg catgegtegg cggctgccct 1740 
ccagcagtga ggaggatgag gcctctcaca 1800 
ttcctcagaa gcaaccccag accaaaacca 1860 
cccagaagcc cccaacccct gaagagacca 1920 
tagacattga gggggtacag tcagaaggac 1980 
cagaggatga gctgaggagg gtggcagagc 2040 
aggagaatgg ggaagacccg tatgeagget 2100 
accaggagcc tcctgatctg ccagtccctg 2160 
ttctttcttt aeggggagtt ccctggggac 2220 
gecttcaatg gggagctccc tggcattcgt 2280 
gaagcagaag ttacttcctc accagctcta 2340 
ctatacacac acacacacac acacacacac 2400 
gagttggttc tcatccaaga gtctcccaaa 2460 
gaatgctggg cacctccgtc agagatctgg 2520 
ggccccttat ccctgtgttg gcaagaggca 2580 
ccctcaactt ' tttctggccc tctgagccac 2640 
gannntgggt acccatgagg cttgtctctc 2700 

2713 



<210> 28 
<211> 2143 
<212> DNA 
<213> Humanus 



<400> 28 

cggagtgccg ggtcgcgcct gcaccgaagg 
cagggaccgt cccccaactc ccctttccgc 
cgctggggac ccgcgcagcc cagaccgttc 
cctgaggcct gtgeagcaac tccagggggg 
gegagggtec egggectgag ccccgcgcca 
tcagctctct caagaggaag caacccaaga 
ccgagatgga gttcaattgc gagatgaagt 
gccggactct ggggctccga gaaacctggt 
cagtggcctg gctcaaaatg gacaagaagg 
cagtcacctt tcacttcttg gecaaatttt 
aggagatcac acaacattta ttcttcttac 
tctactgccc tcctgaggct tctgtgctcc 
gtgactacga ccccagtgtt cacaageggg 
aaagggtaat aaatctgtat cagatgactc 
ggtaegcaga gcaccgaggc egagecaggg 
ctcaggacct ggagatgtac ggtgtgaact 
agetgetget tggagtggat gccctggggc 
cccccaagat ctccttcccg tggaaaaaat 
gagtttacta ttaaaccact ggataagaaa 
cttcgtgtta ataagctgat tctccagcta 
agaaggaaag ccgattcttt ggaagttcag 
gctagaaagc agatgaaaga agaagcaaca 
gagacagctg acctgttggc tgaaaaggee 
geccagaagg ccgcagaggc tgagcaggaa 
aeggaggagg agaagegect gatggagcag 
aagatggctg aggagtcaga gaggagggee 
caggaagcac gcgaggcgga gegaagagee 
cccacgtacc cgcccatgaa cccaattcca 
aacctcattg gtgacagect gtctttcgac 
atggagatag agaaagaaaa agtggaatac 
ctcaatgaac tcaagacaga aatcgaggee 
gatattctgc acaatgagaa ctccgacagg 
aagctcacct tgeagagege caagtcccga 



tcccggctcc tgtgccctcc ctgcagccgt 60 
tcaggcaggg tcctcgcggc ccatgctggc 120 
ccgggccggc cagccggcca ccatggtggc 180 
ctaaagggct cagagtgcag gccgtggggc 24 0 
tggcegggge catcgcttcc egcatgaget 300 
cgttcaccgt gaggategtc accatggacg 360 
ggaaagggaa ggacctcttt gatttggtgt 420 
tctttggact gcagtacaca atcaaggaca 480 
tactggatca tgatgtttca aaggaagaac 54 0 
atcctgagaa tgctgaagag gagctggttc 600 
aggtaaagaa gcagatttta gatgaaaaga 660 
tggcttctta cgccgtccag gecaagtatg 720 
gatttttggc ccaagaggaa ttgcttccaa 780 
cggaaatgtg ggaggagaga attactgett 840 
atgaagctga aatggaatat ctgaagatag 900 
actttgeaat ccggaataaa aagggcacag 960 
ttcacattta tgaccctgag aacagactga 1020 
gaaatccgaa acatctegta cagtgacaag 1080 
attgatgtct tcaagtttaa ctcctcaaag 1140 
tgtatcggga accatgatct atttatgagg 1200 
cagatgaaag cccaggccag ggaggagaag 1260 
atggccaacg aagcactgat geggtctgag 1320 
cagatcaccg aggaggaggc aaaacttctg 1380 
atgeagegea tcaaggccac agegattege 1440 
aaggtgctgg aagccgaggt gctggcactg 1500 
aaagaggcag atcagctgaa gcaggacctg 1560 
aagcagaagc tcctggagat tgccaccaag 1620 
gcaccgttgc ctcctgacat accaagcttc 1680 
ttcaaagata ctgacatgaa . gcggctttcc 1740 
atggaaaaga gcaagcatct gcaggagcag 1800 
ttgaaactga aagagaggga gacagctctg 1860 
ggtggcagca gcaagcacaa taccattaaa 1920 
gtggccttct ttgaagagct ctagcaggtg 1980 
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acccagccac cccaggacct gccacttctc ctgctaccgg gaccgcggga tggaccagat 2040 
atcaagagag ccatccatag ggagctggct gggggtttcc gtgggagctc cagaactttc 2100 
cccagctgac atggaaatac aattgtctgc cggcctcccc tea 2143 



<210> 29 
<211> 540 
<212> DNA 
<213> Humanus 



<400> 29 

gegtagaege 

ccggcactag 

gagaggttcg 

gaggecaaaa 

ttgtacctgg 

ccagcctaca 

ctgacctact 

ctgtcatggt 

agattctgtt 



gccgcgtccc 
gaacagcccc 
accggttcct 
ccggcgtgaa 
tgttcggtta 
tctcaattaa 
gggtagtgta 
tccccttcta 
ctacatctgg 



cgcytgccgc 
gageggegag 
gcacgagaag 
caggagcttc 
tggagectet 
agctatagag 
tggtgtgttc 
etacatgetg 
gagatacttt 



tgctccgccg 
acggtccccg 
aactgeatga 
ategctcttg 
ctcctctgca 
agtcccaaca 
ageattgetg 
aagcagattt 
ggcgaataaa 



cagtcgccgc 
ccatgtctgc 
ctgaccttct 
gtgtcatcgg 
acctgatagg 
aagaagatga 
aattcttctc 
atttagaacc 
acaaagtcct 



tccagtctat 
ggecatgagg 
ggccaagctc 
actggtggcc 
atttggctac 
tacccagtgg 
tgatatcttc 
tccatgtgcc 
tgttgtcaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 



<210> 30 
<211> 4533 
<212> DNA 
<213> Humanus 

<400> 30 

cctactctat tcagatattc tccagattcc taaagattag agatcatttc tcattctcct 60 

aggagtactc acttcaggaa gcaaccagat aaaagagagg tgcaacggaa gecagaacat 120 

tcctcctgga aattcaacct gtttcgcagt ttctcgagga atcagcattc agtcaatccg 180 

ggcegggage agtcatctgt ggtgaggctg attggctggg caggaacagc geeggggegt 240 

gggctgagca cagcgcttcg ctctctttgc cacaggaagc ctgagctcat tegagtageg 300 

gctcttccaa gctcaaagaa geagaggecg ctgttcgttt cctttaggtc tttccactaa 360 

agtcggagta tcttcttcca agatttcacg tcttggtggc cgttccaagg agegegaggt 420 

egggatggat cttgaagggg acegcaatgg aggagcaaag aagaagaact. tttttaaact 480 

gaacaataaa agtgaaaaag ataagaagga aaagaaacca actgtcagtg tattttcaat 540 

gtttegctat tcaaattggc ttgacaagtt gtatatggtg gtgggaactt tggctgecat 600 

catccatggg gctggacttc ctctcatgat gctggtgttt ggagaaatga cagatatctt 660 

tgeaaatgea ggaaatttag aagatctgat gtcaaacatc actaatagaa gtgatatcaa 720 

tgatacaggg ttcttcatga atctggagga agacatgacc aggtatgect attattacag 780 

tggaattggt gctggggtgc tggttgctgc ttacattcag gtttcatttt ggtgcctggc 840 

agctggaaga caaatacaca aaattagaaa acagtttttt catgetataa tgcgacagga 900 

gataggctgg tttgatgtgc acgatgttgg ggagcttaac acccgactta cagatgatgt 960 

ctctaagatt aatgaagtta ttggtgacaa aattggaatg ttctttcagt caatggcaac 1020 

atttttcact gggtttatag taggatttac acgtggttgg aagctaaccc ttgtgatttt 1080 

ggccatcagt cctgttcttg gactgtcagc tgctgtctgg gcaaagatac tatcttcatt 1140 

tactgataaa gaactcttag cgtatgcaaa agctggagca gtagctgaag aggtcttggc 1200 

agcaattaga actgtgattg catttggagg acaaaagaaa gaacttgaaa ggtacaacaa 1260 

aaatttagaa gaagctaaaa gaattgggat aaagaaagct attacageca atatttctat 1320 

aggtgetget ttcctgctga tetatgeate ttatgctctg gccttctggt atgggaccac 1380 

cttggtcctc tcaggggaat attctattgg acaagtactc actgtattct tttctgtatt 1440 

aattggggct tttagtgttg gacaggcatc tccaagcatt gaagcatttg caaatgeaag 1500 

aggagcagct tatgaaatct tcaagataat tgataataag ccaagtattg acagctattc 1560 

gaagagtggg cacaaaccag ataatattaa gggaaatttg gaattcagaa atgttcactt 1620 

cagttaccca tctcgaaaag aagttaagat cttgaagggc ctgaacctga aggtgeagag 1680 

tgggcagacg gtggccctgg ttggaaacag tggctgtggg aagagcacaa cagtccagct 174 0 

gatgeagagg ctctatgacc ccacagaggg gatggtcagt gttgatggac aggatattag 1800 

gaccataaat gtaaggtttc tacgggaaat cattggtgtg gtgagtcagg aacctgtatt 1860 

gtttgccacc acgatagctg aaaacattcg ctatggccgt gaaaatgtca ccatggatga 1920 

gattgagaaa gctgtcaagg aagccaatgc ctatgacttt atcatgaaac tgcctcataa 1980 
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atttgacacc 

cgccattgca 

agccttggac 

tcggaccacc 

tggtttcgat 

aggcatttac 

tgcagctgat 

atccagtcta 

cagaaagctt 

gattatgaag 

tataaatgga 

tacaagaatt 

tctagccctt 

agctggagag 

ggatgtgagt 

caatgatgct 

tatagcaaat 

gttactctta 

gtctggacaa 

agcaatagaa 

gtatgctcag 

aattacattt 

tggagcctac 

agctgttgtc 

caaagccaaa 

cagctacagc 

agttgtattc 

ggtgaagaag 

ggtccagctc 

agaaataaag 

gcccatcctg 

ggtgtcacag 

gtcactgcct 

ccagaaacaa 

ggatgaagcc 

caaagccaga 

tgcagactta 

gctgctggca 

gaagacccaa 

tcttgatgaa 

aaagaacaaa 

atggtcttat 

gtacaaaaga 



ctggttggag agagaggggc 
cgtgccctgg ttcgcaaccc 
acagaaagcg aagcagtggt 
attgtgatag ctcatcgttt 
gatggagtca ttgtggagaa 
ttcaaacttg tcacaatgca 
gaatccaaaa gtgaaattga 
ataagaaaaa gatcaactcg 
agtaccaaag aggctctgga 
ctaaatttaa .ctgaatggcc 
ggcctgcaac cagcatttgc 
gatgatcctg aaacaaaacg 
ggaattattt cttttattac 
atcctcacca agcggctccg 
tggtttgatg accctaaaaa 
gctcaagtta aaggggctat 
cttgggacag gaataattat 
gcaattgtac ccatcattgc 
gcactgaaag ataagaaaga 
aacttccgaa ccgttgtttc 
agtttgcagg taccatacag 
tccttcaccc aggcaatgat 
ttggtggcac ataaactcat 
tttggtgcca tggccgtggg 
atatcagcag cccacatcat 
acggaaggcc taatgccgaa 
aactatccca cccgaccgga 
ggccagacgc tggctctggt 
ctggagcggt tctacgaccc 
cgactgaatg ttcagtggct 
tttgactgca gcattgctga 
gaagagatcg tgagggcagc 
aataaatata gcactaaagt 
cgcattgcca tagctcgtgc 
acgtcagctc tggatacaga 
gaaggccgca cctgcattgt 
atagtggtgt ttcagaatgg 
cagaaaggca tctatttttc 
tttggggttt attacggcat 
gtattgaaac aacaggctag 
attcaaggta agtacaggtg 
ggtcctgaat gggttgggta 
aaaataaatt gttcaatagc 



ccagttgagt 
caagatcctc 
tcaggtggct 
gtctacagtt 
aggaaatcat 
gacagcagga 
tgccttggaa 
taggagtgtc 
tgaaagtata 
ttattttgtt 
aataatattt 
acagaatagt 
atttttcctt 
atacatggtt 
caccactgga 
aggttccagg 
atccttcatc 
aatagcagga 
actagaaggt 
tttgactcag 
aaactctttg 
gtatttttcc 
gagctttgag 
gcaagtcagt 
catgatcatt 
cacattggaa 
catcccagtg 
gggcagcagt 
cttggcaggg 
ccgagcacac 
gaacattgcc 
aaaggaggcc 
aggagacaaa 
ccttgttaga 
aagtgaaaag 
gattgctcac 
cagagtcaag 
aatggtcagt 
ttgaaatttg 
ctctccctaa 
tagaaatata 
gacaggaaag 
etc 



ggtgggcaga 
ctgctggatg 
ctggataagg 
cgtaatgctg 
gatgaactca 
aatgaagttg 
atgtcttcaa 
cgtggatcac 
cctccagttt 
gttggtgtat 
tcaaagatta 
aacttgtttt 
cagggtttca 
ttccgatcca 
gcattgacta 
cttgctgtaa 
tatggttggc 
gttgttgaaa 
gctgggaaga 
gagcagaagt 
aggaaagcac 
tatgetggat 
gatgttctgt 
teatttgetc 
gaaaaaaccc 
ggaaatgtca 
cttcagggac 
ggctgtggga 
aaagtgctgc 
ctgggcatcg 
tatggagaca 
aacatacatg 
ggaactcagc 
cagcctcata 
gttgtccaag 
cgcctgtcca 
gagcatggca 
gtccaggctg 
agagttagaa 
gttattagaa 
gaagccacag 
tttgggaata 



agcagaggat 

aggccacgtc 

ccagaaaagg 

acgtcatcgc 

tgaaagagaa 

aattagaaaa 

atgattcaag 

aageccaaga 

ccttttggag 

tttgtgccat 

taggggtfctt 

cactattgtt 

catttggcaa 

tgctcagaca 

ccaggctcgc 

ttacccagaa 

aactaacact 

tgaaaatgtt 

tegctactga 

ttgaacatat 

acatctttgg 

gtttccggtt 

tagtattttc 

ctgactatgc 

ctttgattga 

catttggtga 

tgagcctgga 

agagcacagt 

ttgatggcaa 

tgtcccagga 

acagcegggt 

ccttcatcga 

tctctggtgg 

ttttgctttt 

aagccctgga 

ccatccagaa 

cgcatcagca 

gaacaattta 

gtgacacctg 

acatccagaa 

acatagggaa 

attttcccaa 



2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4533 



<210> 31 
<211> 2307 
<212> DNA 
<213> Humanus 



<400> 31 

cctactctat 

aggagtactc 

tcctcctgga 

ggcegggage 

gggctgagca 

gctcttccaa 

agtcggagta 

egggatggat 

gaacaataaa 

gtttegctat 



tcagatattc 
acttcaggaa 
aattcaacct 
agtcatctgt 
cagcgcttcg 
gctcaaagaa 
tcttcttcca 
cttgaagggg 
agtgaaaaag 
tcaaattggc 



tccagattcc 
gcaaccagat 
gtttcgcagt 
ggtgaggctg 
ctctctttgc 
geagaggecg 
agatttcacg 
acegcaatgg 
ataagaagga 
ttgacaagtt 



taaagattag 
aaaagagagg 
ttctcgagga 
attggctggg 
cacaggaagc 
ctgttcgttt 
tcttggtggc 
aggagcaaag 
aaagaaacca 
gtatatggtg 



agatcatttc 
tgcaacggaa 
atcagcattc 
caggaacagc 
ctgagctcat 
cctttaggtc 
cgttccaagg 
aagaagaact 
actgtcagtg 
gtgggaactt 



tcattctcct 
gecagaacat 
agtcaatccg 
geeggggegt 
tegagtageg 
tttccactaa 
agegegaggt 
tttttaaact 
tattttcaat 
tggctgecat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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catccatggg gctggacttc ctctcatgat 
tgcaaatgca ggaaatttag aagatctgat 
tgatacaggg ttcttcatga atctggagga 
tggaattggt gctggggtgc tggttgctgc 
agctggaaga caaatacaca aaattagaaa 
gataggctgg tttgatgtgc acgatgttgg 
ctccaagatt aatgaaggaa ttggtgacaa 
atttttcact gggtttatag taggatttac 
ggccatcagt cctgttcttg gactgtcagc 
tactgataaa gaactcttag cgtatgcaaa 
agcaattaga actgtgattg catttggagg 
aaatttagaa gaagctaaaa gaattgggat 
aggtgctgct ttcctgctga tctatgcatc 
cttggtcctc tcaggggaat attctattgg 
aattggggct tttagtgttg gacaggcatc 
aggagcagct tatgaaatct tcaagataat 
gaagagtggg cacaaaccag ataatattaa 
cagttaccca tctcgaaaag aagttaagat 
tgggcagacg gtggccctgg ttggaaacag 
gatgcagagg ctctatgacc ccacagaggg 
gaccataaat gtaaggtttc tacgggaaat 
gtttgccacc acgatagctg aaaacattcg 
gattgagaaa gctgtcaagg aagccaatgc 
atttgacacc ctggttggag agagaggggc 
cgccattgca cgtgccctgg ttcgcaaccc 
agcattggac acagaaagtg aagctgaggt 
ctaaaaagct gaaggaccac cacattgaaa 
agtctgggct gagaaacaga aacatagcaa 
tctcctttct gtcaggttac attgtcc 



gctggtgttt ggagaaatga cagatatctt 660 
gtcaaacatc actaatagaa gtgatatcaa 720 
agacatgacc aggtatgcct attattacag 780 
ttacattcag gtttcatttt ggtgcctggc 840 
acagtttttt catgctataa tgcgacagga 900 
ggagcttaac acccgactta cagatgatgt 960 
aattggaatg ttctttcagt caatggcaac 1020 
acgtggttgg aagctaaccc ttgtgatttt 1080 
tgctgtctgg gcaaagatac tatcttcatt 1140 
agctggagca gtagctgaag aggtcttggc 1200 
acaaaagaaa gaacttgaaa ggtacaacaa 1260 
aaagaaagct attacagcca atatttctat 1320 
ttatgctctg gccttctggt atgggaccac 1380 
acaagtactc actgtattyt tttctgtatt 1440 
tccaagcatt gaagcatttg caaatgcaag 1500 
tgataataag ccaagtattg acagctattc 1560 
gggaaatttg gaattcagaa atgttcactt 1620 
cttgaagggc ctgaacctga aggtgcagag 1680 
tggctgtggg aagagcacaa cagtccagct 17 40 
gatggtcagt gttgatggac aggatattag 1800 
cattggtgtg gtgagtcagg aacctgtatt 1860 
ctatggccgt gaaaatgtca ccatggatga 1920 
ctatgacttt atcatgaaac tgcctcataa 1980 
ccagctgagt ggtgggcaga agcagaggat 2040 
caagatcctt ctgctggatg aggccacgtc 2100 
acaggcagct ctggataagg tcagtagact 2160 
cctattgaag attcttgcca gtgcttccgg 2220 
atggagctac ctcatggagc tgtattgatt 2280 

2307 



<210> 32 
<211> 1712 
<212> DNA 
<213> Humanus 

<400> 32 

gcttggattg cggagccgcg agcagcgctg ggtaacggcc gcggcgacca ccccggacgg 60 

cccctgtccc cgctggcggg cttccctgtc gccgttcgct gcgctgccgg cttcttggtg 120 

aatttttgga tgaagccatt aaattaattg cttgccatca tgagcagaag caagcgtgac 180 

aacaattttt atagtgtaga gattggagat tctacattca cagtcctgaa acgatatcag 240 

aatttaaaac ctataggctc aggagctcaa ggaatagtat gcgcagctta tgatgccatt 300 

cttgaaagaa atgttgcaat caagaagcta agccgaccat ttcagaatca gactcatgcc 360 

aagcgggcct acagagagct agttcttatg aaatgtgtta atcacaaaaa tataattggc 420 

cttttgaatg ttttcacacc acagaaatcc ctagaagaat ttcaagatgt ttacatagtc 480 

atggagctca tggatgcaaa tctttgccaa gtgattcaga tggagctaga tcatgaaaga 540 

atgtcctacc ttctctatca gatgctgtgt ggaatcaagc accttcattc tgctggaatt' 600 

attcatcggg acttaaagcc cagtaatata gtagtaaaat ctgattgcac tttgaagatt 660 

cttgacttcg gtctggccag gactgcagga acgagtttta tgatgacgcc ttatgtagtg 720 

actcgctact acagagcacc cgaggtcatc cttggcatgg gctacaagga aaacggagga 780 

agaatgggaa aaggcatatt cacaaggtta caataaggtg cctgtgagat ataaaattta 840 

taactgccac atcctttctt aggaattttt aaatttctat tttcttgtaa tatgaataca 900 

agaatacatt cttgtaaatg aatgtattga acattagtta tggagtattt ttcttagcta 960 

cttgatatta gatattgatc agtggaataa agttattgaa cagcttggaa caccatgtcc 1020 

tgaattcatg aagaaactgc aaccaacagt aaggacttac gttgaaaaca gacctaaata 1080 

tgctggatat agctttgaga aactcttccc tgatgtcctt ttcccagctg actcagaaca 1140 

caacaaactt aaagccagtc aggcaaggga tttgttatcc aaaatgctgg taatagatgc 1200 

atctaaaagg atctctgtag atgaagctct ccaacacccg tacatcaatg tctggtatga 1260 

tccttctgaa gcagaagctc caccaccaaa gatccctgac aagcagttag atgaaaggga 1320 

acacacaata gaagagtgga aagaattgat atataaggaa gttatggact tggaggagag 1380 

aaccaagaat ggagttatac gggggcagcc ctctccttta gcacaggtgc agcagtgatc 1440 
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aatggctctc agcatccatc atcatcgtcg 
gatccgactt tggcctctga tacagacagc 
tgctgtagat gactacttgg gccatcgggg 
tgatagaact actttgaaaa caattcagtg 
gtaggatttc attttgtagt aaagtagttt 



tctgtcaatg atgtgtcttc aatgtcaaca 15< 

agtctagaag cagcagctgg gcctctgggc 15' 

ggtgggaggg atggggagtc ggttagtcat 16; 

gtcttatttt tgggtgattt ttcaaaaaat 16 



<210> *33 
<211> 1068 
<212> DNA 
<213> Humanus 



<400> 33 .^f« ar a araattttta taqtgtagag 60 

cattaattgc ttgccatcat gagcagaagc aagcgtgaca «««tttt g g SJJ ^ 

attggagatt ctacattcac agtcctgaaa cgatateag. Jjtt tgttgcaatc 180 

ggagctcaag gaatagtatg cgcagcttat g^gcoattc ttg g cagagagcta 240 
aagaagctaa gccgaccatt tcagaatcag *£catgcca ag gg tttcacacca 300 

gttcttatga aatgtgttaa tcacaaaaat ataattggcc tt <3 ^ ggatgcaaat 360 
cagaaatccc tagaagaatt tcaagatgtt = ;* tatcctacct tctctatcag 420 

ct?tgccaag tgattcagat ; Wjjctagat catgaaagaa tgtccta 48 
atgctgtgtg gaatcaagca «ttcattct g f ttgactt cgg tctggccagg 540 

agtaatatag tagtaaaatc tgattgcact £tg g _ y tacta cagagcaccc 600 

actgcaggaa cgagttttat Jjtgacgcct t.tgtagtg. £cg gg ? gcatt 660 

gaggtcatcc ttggcatggg cta " a ^ a ^ a3 ^ ggaa gggac tatat tgatcagtgg 720 
atgggagaaa tggtttgcca jaaaatccto "tccagg actgcaaC ca 7 80 

aataaagtta ttgaacagct t99 aa «"J aaatatqctg gatatagctt tgagaaactc 840 

;r=nsn; r=snuu = i£ »~ 
susss susss s , , , =? - s 

gggaaaaaaa tgaggttttt gttttttttt tctttaatct tatatatt 



<210> 34 
<211> 1388 
<212> DNA 
<213> Humanus 



tt^tcc -nj- imlllttl llllllllll llo 

.»„.,.«: =t,=,« r c xuss «r»" g ;r. t«.4L.tc iso 

ggagctcaag gaatagtatg cgcagctcat y * aflcaDac cta cagagagcta 240 

lagaagctaa gccgaccatt tcagaatcag "tcatgcca J J 3Q0 

gttcttatga aatgtgttaa tcacaaaaat ataa "gg^ ""J^ c | t ggat gcaaat 360 
cagaaatccc tagaagaatt tcaagatgtt tacatagtca Jggagctcat gg J 42Q 
ct«gccaag tgattcagat ggagctagat "tgaaagaa Jgtcctac 48Q 
atgctgtgtg gaatcaagca -ttcattct ? ctggaatta ttcatcggg 5 
agtaatatag tagtaaaatc tgattgcact "^aag y qctacta cagagcaccc 600 

actgcaggaa cgagttttat gatgacgcct ta ^agtga £cgcta g 9 g6() 

gaggtcatcc ttggcatggg ctacaaggaa aa ^ggatt tatggtct^ JWJJ^ 72Q 
atgggagaaa tggtttgcca caaaatcctc tttccagg ««« act gcaacca 780 

aa?aaagtta ttgaacagct tggaacacca tgtcctgaat tcatgaagaa 9 84Q 
acagtaagga cttacgttga aaacagacct -atatgctg ^tatagctt J g00 
ttccctgatg tccttttccc agctgactca gaacacaaca £ tgtagatgaa 960 

agggat?tgt tatccaaaat gctggtaata gatgcatcta ««9gatctc J J J 1Q2Q 
gctctccaac acccgtacat caatgtctgg tatgatcctt ^gaagcaga J J 1Q80 
?gtaagttat tttcttaatg tttacagaac atattgcatt ="agagtta 9 J J 114Q 
taggt?tgga ggagaccttt taattttaaa a aaaaa ^ g a ^^tcga cccctccta 1200 
ttctgtttct tcatgaagac tacg caaa ^ggatacc Laccagaga 1260 

V££S£ "-go- gggggaaca^ cct^tcag gta^aaaaca aattactgta 1320 



25 



tccaaaggaa aacagaattc tgtgatctgt gatataaata aaatgtggca atttcaagag 1380 
ctagaaga 1388 



<210> 35 
<211> 501 
<212> PRT 
<213> Humanus - 

<400> 35 

Met Val Arg Ser Gly Asn Lys Ala Ala Val Val Leu Cys Met Asp Val 
1 5 10 15 

Gly Phe Thr Met Ser Asn Ser lie Pro Gly lie Glu Ser Pro Phe Glu 
20 25 30 

Gin Ala Lys Lys Val lie Thr Met Phe Val Gin Arg Gin Val Phe Ala 
35 40 45 

Glu Asn Lys Asp Glu He Ala Leu Val Leu Phe Gly Thr Asp Gly Thr 
50 55 60 

Asp Asn Pro Leu Ser Gly Gly Asp Gin Tyr Gin Asn He Thr Val His 
65 70 75 80 

Arg His Leu Met Leu Pro Asp Phe Asp Leu Leu Glu Asp He Glu Ser 
85 90 95 

He Val Ser Met Asp Val He Gin His Glu Thr He Gly Lys Lys Phe 
100 105 HO 

Glu Lys Arg His He Glu He Phe Thr Asp Leu Ser Ser Arg Phe Ser 
115 120 125 

Lys Ser Gin Leu Asp He He He His Ser Leu Lys Lys Cys Asp He 
130 135 140 

Ser Leu Gin Phe Phe Leu Pro Phe Ser Leu Gly Lys Glu Asp Gly Ser 
145 150 .155 160 

Gly Asp Arg Gly Asp Gly Pro Phe Arg Leu Gly Gly His Gly Pro Ser 
165 170 175 

He Val Lys Met Val Met He Ser Leu Glu Gly Glu Asp Gly Leu Asp 
180 185 190 

Glu He Tyr Ser Phe Ser Glu Ser Leu Arg Lys Leu Cys Val Phe Lys 
195 200 205 

Lys He Glu Arg His Ser He His Trp Pro Cys Arg Leu Thr He Gly 
210 215 220 

Ser Asn Leu Ser He Arg He Ala Ala Tyr Lys Ser He Leu Gin Glu 
225 230 235 240 

Arg Val Lys Lys Thr Trp Thr Val Val Asp Ala Lys Thr Leu Lys Lys 
245 250 255 

Glu Thr Glu Val Leu Lys Glu Asp He He Gin Gly Phe Arg Tyr Gly 
260 265 270 
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Ser Asp He Val Pro Phe Ser Lys Val Asp Glu Glu Gin Met Lys Tyr 
275 280 285 

Lys Ser Glu Gly Lys Cys Phe Ser Val Leu Gly Phe Cys Lys Ser Ser 
290 295 300 

Gin Val Gin Arg Arg Phe Phe Met Gly Asn Gin Val Leu Lys Val Phe 
305 310 315 320 

Ala Ala Arg Asp Asp Glu Ala Ala Ala Val Ala Leu Ser Ser Leu He 
325 330 335 

His Ala Leu Asp Asp Leu Asp Met Val Ala He Val Arg Tyr Ala Tyr 
340 345 350 

Lys His Asn Tyr Glu Cys Leu Val Tyr Val Gin Leu Pro Phe Met Glu 
355 360 365 

Asp Leu Arg Gin Tyr Met Phe Ser Ser Leu Lys Asn Ser Lys Lys Tyr 
370 375 380 

Ala Pro Thr Glu Ala Gin Leu Asn Ala Val Asp Ala Leu He Asp Ser 
385 390 395 400 

Met Ser Leu Ala Lys Lys Asp Glu Lys Thr Asp Thr Leu Glu Asp Leu 
405 410 415 

Phe Pro Thr Thr Lys He Pro Asn Pro Arg Phe Gin- Arg Leu Phe Gin 
420 425 430 

Val Arg Glu Glu Gly 
435 



<210> 36 
<211> 521 
<212> PRT 
<213> Humanus 

<400> 36 _ 
Met Val Arg Ser Gly Asn Lys Ala Ala Val Val Leu Cys Met Asp Val 
15 10 15 

Glv Phe Thr Met Ser Asn Ser He Pro Gly He Glu Ser Pro Phe Glu 
20 25 30 

Gin Ala Lys Lys Val He Thr Met Phe Val Gin Arg Gin Val Phe Ala 
35 40 45 

Glu Asn Lys Asp Glu He Ala Leu Val Leu Phe Gly Thr Asp Gly Thr 
50 55 60 

Asp Asn Pro Leu Ser Gly Gly Asp Gin Tyr Gin Asn He Thr Val His 
65 70 75 80 

Arg His Leu Met Leu Pro Asp Phe Asp Leu Leu Glu Asp He Glu Ser 
85 90 95 

Lys He Gin Pro Gly Ser Gin Gin Ala Asp Phe Leu Asp Ala Leu He 
100 105 HO 
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Val Ser Met Asp Val He Gin His Glu Thr He Gly Lys Lys Phe Glu 
115 120 125 

Lys Arg His He Glu He Phe Thr Asp Leu Ser Ser Arg Phe Ser Lys 
130 135 140 

Ser Gin Leu Asp He He He His Ser Leu Lys Lys Cys Asp He Ser . 
145 150 155 160 

Leu Gin Phe Phe Leu Pro Phe Ser Leu Gly Lys Glu Asp Gly Ser Gly 
165 170 175 

Asp Arg Gly Asp Gly Pro Phe Arg Leu Gly Gly His Gly Pro Ser Phe 
180 185 190 

Pro Leu Lys Gly He Thr Glu Gin Gin Lys Glu Gly Leu Glu He Val 
195 200 205 

Lys Met Val Met He Ser Leu Glu Gly Glu Asp Gly Leu Asp Glu He 
210 215 220 

Tyr Ser Phe Ser Glu Ser Leu Arg Lys Leu Cys Val Phe Lys Lys He 
225 230 235 240 

Glu Arg His Ser He His Trp Pro Cys Arg Leu Thr He Gly Ser Asn 
245 250 255 

Leu Ser He Arg He Ala Ala Tyr Lys Ser He Leu Gin Glu Arg Val 
260 265 270 

Lys Lys Thr Trp Thr Val Val Asp Ala Lys Thr Leu Lys Lys Glu Asp 
275 280 285 

He Gin Lys Glu Thr Val Tyr Cys Leu Asn Asp Asp Asp Glu Thr Glu 
290 295 300 

Leu Asn Pro Pro Ala Glu Val Thr Thr Lys Ser Gin He Pro Leu Ser 
305 310 315 320 

Lys lie Lys Thr Leu Phe Pro Leu He Glu Ala Lys Lys Lys Asp Gin 
325 330 335 

Val Thr Ala Gin Glu He Phe Gin Asp Asn His Glu Asp Gly Pro Thr 
340 345 350 

Ala Lys Lys Leu Lys Thr Glu Gin Gly Gly Ala His Phe Ser Val Ser 
355 360 365 

Ser Leu Ala Glu Gly Ser Val Thr Ser Val Gly Ser Val Asn Pro Ala 
370 375 380 

Glu Asn Phe Arg Val Leu Val Lys Gin Lys Lys Ala Ser Phe Glu Glu 
385 390 395 400 

Ala Ser Asn Gin Leu He Asn His He Glu Gin Phe Leu Asp Thr Asn 
405 410 415 

Glu Thr Pro Tyr Phe Met Lys Ser He Asp Cys He Arg Ala Phe Arg 
420 425 430 
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Glu Glu Ala He Lys 
435 

Lys Ala Leu Gin Glu 
450 

Glu He Val Val Gin 
465 

Ser Gly Ser Ser Val 
485 

Lys Asp Lys Pro Ser 
500 

Asp Val Asp Asp Leu 
515 



Phe Ser Glu Glu Gin Arg 
440 

Lys Val Glu He Lys Gin 
■455 

Asp Gly He Thr Leu He 
470 475 

Thr Ala Glu Glu Ala. Lys 
490 . 

Gly Asp Thr Ala Ala Val 
505 

Leu Asp Met He 
520 



Phe Asn Asn Phe Leu 
445 

Leu Asn His Phe Trp 
460 

Thr Lys Glu Glu Ala 
480 

Lys Phe Leu Ala Pro 
495 

Phe Glu Glu Gly Gly 
510 



<210> 37 
<211> 437 
<212> PRT 
<213> Humanus 

<400> 37 

Met Glv Cvs Glv Cys Ser Ser His Pro Glu Asp Asp Trp Met Glu Asn 
15 10 15 

He Asp Val Cys Glu Asn Cys His Tyr Pro He Val Pro Leu Asp Gly 
20 25 30 

Lys Gly Thr Leu Leu He Arg Asn Gly Ser Glu Val Arg Asp Pro Leu 
35 40 45 

Val Thr Tyr Glu Gly Ser Asn Pro Pro Ala Ser Pro Leu Gin Asp Asn 
50 55 60 

Leu Val He Ala Leu His Ser Tyr Glu Pro Ser His Asp Gly Asp Leu 
65 70 75 80 

Gly Phe Glu Lys Gly Glu Gin Leu Arg He Leu Glu Gin Ser Gly Glu 
85 90 95 

Trp Trp Lys Ala Gin Ser Leu Thr Thr Gly Gin Glu Gly Phe He Pro 
100 105 HO 

Phe Asn Phe Val Ala Lys Ala Asn Ser Leu Glu Pro Glu Pro Trp Phe 
115 120 125 

Phe Lys Asn Leu Ser Arg Lys Asp Ala Glu Arg Gin Leu Leu Ala Pro 
130 135 140 

Gly Asn Thr His Gly Ser Phe Leu He Arg Glu Ser Glu Ser Thr Ala 
145 150 155 160 

Gly Ser Phe Ser Leu Ser Val Arg Asp Phe Asp Gin Asn Gin Gly Glu 
165 170 175 

Val Val Lys His Tyr Lys He Arg Asn Leu Asp Asn Gly Gly Phe Tyr 
180 185 190 
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lie Ser Pro Arg 
195 

Tyr Thr Asn Ala 
210 

Gin Thr Gin Lys 
225 

Pro Arg Glu Thr 



Gly Glu Val Trp 
260 

Lys Ser Leu Lys 
275 



lie Thr Phe Pro 
200 

Ser Asp- Gly Leu 
215 

Pro Gin Lys Pro 
230 

Leu Lys Leu Val 
245 

Met Gly Tyr Tyr 



Gin Gly Ser Met 
280 



Gly Leu His Glu 



Cys Thr Arg Leu 
220 

Trp Trp Glu Asp 
235 

Glu Arg Leu Gly 
250 

Asn Gly His Thr 
265 

Ser Pro Asp Ala 



Leu Val Arg His 
205 

Ser Arg Pro Cys 



Glu Trp Glu Val 
240 

Ala Gly Gin Phe 
255 

Lys Val Ala Val 
270 

Phe Leu Ala Glu 
285 



Ala Asn Leu Met 
290 

Ala Val Val Thr 
305 

Asn Gly Ser Leu 



Thr lie Asn Lys 
340 

Ala Phe lie Glu 
355 

Asn lie Leu Val 
370 

Leu Ala Arg Leu 
385 

Phe Trp Asp Pro 



Pro Arg Asp Asp 
420 

Pro His Gly Ala 
435 



Lys Gin Leu Gin 
295 

Gin Glu Pro lie 
310 

Val Asp Phe Leu 
325 

Leu Leu Asp Met 



Glu Arg Asn Tyr 
360 

Ser Asp Thr Leu 
375 

lie Glu Asp lie 
390 

Ala Asp Gly Asn 
405 

Gin Pro Gly Gly 



Pro 



His Gin Arg Leu 
300 

Tyr lie lie Thr 
315 

Lys Thr Pro Ser 
330 

Ala Ala Gin lie 
345 

lie His Arg Asp 



Ser Cys Lys lie 
380 

His His Gin Val 
' 395 

Cys His Pro Arg 
410 

Asp Ser Glu Pro 
425 



Val Arg Leu Tyr 



Glu Tyr Met Glu 
320 

Gly lie Lys Leu 
335 

Ala Glu Gly Met 
350 

Leu Arg Ala Ala 
365 

Ala Asp Phe Gly 



Arg Cys Val Val 
400 

Pro His Pro Leu 
415 

Gly Ala Arg Leu 
430 



<210> 38 
<211> 567 
<212> PRT 
<213> Humanus 



<400> 38 

Met Gly Cys Gly Cys Ser Ser His Pro Glu Asp Asp Trp Met Glu Asn 

15 10 15 

lie Asp Val Cys Glu Asn Cys His Tyr Pro lie Val Pro Leu Asp Gly 
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20 



25 



30 



Lys Gly Thr Leu Leu lie Arg Asn Gly Ser Glu Val Arg Asp Pro Leu 
35 40 45 

Val Thr Tyr Glu Gly Ser Asn Pro Pro Ala Ser Pro Leu Gin Gly Asp 
50 55 60 

Pro Arg Gin Gin Gly Leu Lys Asp Lys Ala Cys Gly Ser Leu Ala Val 
65 70 75 80 

Gly Phe His Leu Ser Pro Thr Tyr Phe Leu Pro Gly Leu Ala Phe Leu 
85 90 95 

Val Pro His Pro Val Thr Pro Gly Phe Leu Pro lie Pro Ala Arg Phe 
100 105 110 

Ser Leu Thr Pro Leu Val Phe Thr Asp Asn Leu Val lie Ala Leu His 
115 120 125 

Ser Tyr Glu Pro Ser His Asp Gly Asp Leu ;Gly Phe Glu Lys Gly Glu 
130 135 140 

Gin Leu Arg lie Leu Glu Gin Ser Gly Glu Trp Trp Lys Ala Gin Ser 
145 150 155 160 

Leu Thr Thr Gly Gin Glu Gly Phe lie Pro Phe Asn Phe Val Ala Lys 
165 170 175 

Ala Asn Ser Leu Glu Pro Glu Pro Trp Phe Phe Lys Asn Leu Ser Arg 
180 185 190 

Lys Asp Ala Glu Arg Gin Leu Leu Ala Pro Gly Asn Thr His Gly Ser 
195 200 205 

Phe Leu lie Arg Glu Ser Glu Ser Thr Ala Gly Ser Phe Ser Leu Ser 
210 215 220 

Val Arg Asp Phe Asp Gin Asn Gin Gly Glu Val Val Lys His Tyr Lys 
225 230 235 240 

lie Arg Asn Leu Asp Asn Gly Gly Phe Tyr lie Ser Pro Arg lie Thr 
245 250 255 

Phe Pro Gly Leu His Glu Leu Val Arg His Tyr Thr Asn Ala Ser Asp 
260 265 270 

Gly Leu Cys Thr Arg Leu Ser Arg Pro Cys Gin Thr Gin Lys Pro Gin 
275 280 285 

Lys Pro Trp Trp Glu Asp Glu Trp Glu Val Pro Arg Glu Thr Leu Lys 
290 295 300 

Leu Val Glu Arg Leu Gly Ala Gly Gin Phe Gly Glu Val Trp Met Gly 
305 310 315 320 

Tyr Tyr Asn Gly His Thr Lys Val Ala Val Lys Ser Leu Lys Gin Gly 
325 330 335 

Ser Met Ser Pro Asp Ala Phe Leu Ala Glu Ala Asn Leu Met Lys Gin 
340 345 350 
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Leu Gin His Gin Arg Leu Val Arg Leu Tyr Ala Val Val Thr Gin Glu 
355 360 365' 

Pro lie Tyr lie lie Thr Glu Tyr Met Glu Asn Gly Ser Leu Val Asp 
370 375 380 

Phe Leu Lys Thr Pro Ser Gly lie Lys Leu Thr lie Asn Lys Leu Leu 
385 390 395 400 

Asp Met Ala Ala Gin He Ala Glu Gly Met Ala Phe He Glu Glu Arg 
405 410 415 

Asn Tyr lie His Arg Asp Leu Arg Ala Ala Asn He Leu Val Ser Asp 
420 425 430 

Thr Leu Ser Cys Lys He Ala Asp Phe Gly Leu Ala Arg Leu He Glu 
435 440 445 

Asp Asn Glu Tyr Thr Ala Arg Glu Gly Ala Lys Phe Pro He Lys Trp 
450 455 460 

Thr Ala Pro Glu Ala He Asn Tyr Gly Thr Phe Thr He Lys Ser Asp 
465 470 475 480 

Val Trp Ser Phe Gly He Leu Leu Thr Glu He Val Thr His Gly Arg 
485 490 495 

He Pro Tyr Pro Gly Met Thr Asn Pro Glu Val He Gin Asn Leu Glu 
500 505 510 

Arg Gly Tyr Arg Met Val Arg Pro Asp Asn Cys Pro Glu Glu Leu Tyr 
515 520 525 

Gin Leu Met Arg Leu Cys Trp Lys Glu Arg Pro Glu Asp Arg Pro Thr 
530 535 540 

Phe Asp Tyr Leu Arg Ser Val Leu Glu Asp Phe Phe Thr Ala Thr Glu 
545 550 .555 560 

Gly Gin Tyr Gin Pro Gin Pro 
565 



<210> 39 
<211> 192 
<212> PRT 
<213> Humanus 

<400> 39 

Met Arg He Ala Val He Cys Phe Cys Leu Leu Gly He Thr Cys Ala 
15 10 15 

He Pro Val Lys Gin Ala Asp Ser Gly Ser Ser Glu Glu Lys Gin Leu 
20 25 30 

Tyr Asn Lys Tyr Pro Asp Ala Val Ala Thr Trp Leu Asn Pro Asp Pro 
35 40 45 

Ser Gin Lys Gin Asn Leu Leu Ala Pro Gin Asn Ala Val Ser Ser Glu 
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•50 

Glu Thr Asn Asp Phe 
65 

Ser His Asp His Met 
85 

Val Asp Ser Gin Asp 
100 

Asp Thr Asp Asp Ser 
115 

Ser Asp Glu Leu Val 
130 

Val Phe Thr Pro Val 
145 

Asp Ser Val Val Tyr 
165 

Pro Asp lie Gin Val 
180 



55 

Lys Gin. Glu Thr Leu Pro 
70 75 

Asp Asp Met Asp Asp Glu 
90 

Ser lie Asp Ser Asn Asp 
105 

His Gin Ser Asp Glu Ser 
120 

Thr Asp Phe Pro Thr Asp 
135 

Val Pro Thr Val Asp Thr 
150 155 

Gly Leu Arg Ser Lys Ser 
170 

Asn Pro Leu Thr Asp Thr 
185 



60 

Ser Lys Ser Asn Glu 
80 

Asp Asp Asp Asp His 
95 

Ser Asp Asp Val Asp 
110 

His His Ser Asp Glu 
125 

Leu Pro Ala Thr Glu 
140 

Tyr Asp Gly Arg Gly 
160 



Lys Lys Phe Arg Arg 
175 

Pro Asp Gly Ser Asp 
190 



<210> 40 
<211> 109 
<212> PRT 
<213> Humanus 

<400> 40 

Met Glu Leu Gly Leu Pro Gin Val Pro Pro Ala Val Asp Ala Glu Leu 
1 5 10. 15 

Leu Cys Arg Phe Val Asp Arg Gly Leu Pro Tyr Pro Asp Val Ser Ser 
20 25 30 

Ala Asn Thr Pro Pro Ala Val Gly Leu Ser Pro Pro Thr Pro Tyr Phe 
35 40 45 

Glu Pro Cys Ala Leu Pro Ser Pro His Arg His Gin Leu Ala Glu Ala 
50 55 60 

lie Pro Cys Thr Leu Ala Val Ser Asn Pro His Thr Asp Ala Trp Lys 
65 70 75 80 

Ser His Gly Leu Val Glu Val Ala Ser Tyr Cys Glu Glu Ser Arg Gly 
85 90 95 

Asn Asn Gin Trp Val Pro Tyr lie Ser Leu Gin Glu Arg 
100 105 



<210> 41 
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<211> 331 
<212> PRT 
<213> Humanus 



Me^Arq^Ala Arg Pro Gin Val Cys Glu Ala Leu Leu Phe Ala Leu Ala 
x s 10 15 



Leu Gin Thr Gly Val Cys Tyr Gly He Lys Trp Leu Ala Leu Ser Lys 
20 25 30 

Thr Pro Ser Ala Leu Ala Leu Asn Gin Thr Gin His Cys Lys Gin Leu 
35 40 45 

Glu Gly Leu Val Ser Ala Gin Val Gin Leu Cys Arg Ser Asn Leu Glu 
50 55 60 

Leu Met His Thr Val Val His Ala Ala Arg Glu Val Met Lys Ala Cys 
65 70 75 BU 

Arg Arg Ala Phe Ala Asp Met Arg Trp Asn Cys Ser Ser He Glu Leu 
85 90 95 

Ala Pro Asn Tyr Leu Leu Asp Leu Glu Arg Gly Thr Arg Glu Ser Ala 
100 105 110 

Phe Val Tyr Ala Leu Ser Ala Ala Ala He Ser His Ala He Ala Arg 
115 120 125 

Ala Cys Thr Ser Gly Asp Leu Pro Gly Cys Ser Cys Gly Pro Val Pro 
130 135 140 

Gly Glu Pro Pro Gly Pro Gly Asn Arg Trp Gly Arg Cys Ala Asp Asn 
145 150 155 16° 

Leu Ser Tyr Gly Leu Leu Met Gly Ala Lys Phe Ser Asp Ala Pro Met 
165 170 I 75 

Lys Val Lys Lys Thr Gly Ser Gin Ala Asn Lys Leu Met Arg Leu His 
180 185 19° 

Asn Ser Glu Val Gly Arg Gin Ala Leu Arg Ala Ser Leu Glu Met Lys 
195 200 205 

Cys Lys Cys His Gly Val Ser Gly Ser Cys Ser He Arg Thr Cys Trp 
210 215 220 

Lys Gly Leu Gin Glu Leu Gin Asp Val Ala Ala Asp Leu Lys Thr Arg 
225 230 235 240 

Tyr Leu Ser Ala Thr Lys Val Val His Arg Pro Met Gly Thr Arg Lys 
245 250 255 

His Leu Val Pro Lys Asp Leu Asp He Arg Pro Val Lys Asp Ser Glu 
260 265 270 

Leu Val Tyr Leu Gin Ser Ser Pro Asp Phe Cys Met Lys Asn Glu Lys 
275 280 285 

Val Gly Ser His Gly Thr Gin Asp Arg Gin Cys Asn Lys Thr Ser Asn 
290 295 300 
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Gly Ser Asp Ser Cys 
305 

Cys Glu Arg Thr Val 
325 



Asp Leu Met Cys Cys Tyr 

310 315 

Glu Arg Tyr Val Cys Lys 
330 



Val Thr Cys Arg Arg 
320 



<210> 42 
<211> 237 
<212> PRT 
<213> Humanus 

<400> 42 

Met Arg Ala Arg Pro Gin Val Cys Glu Ala Leu Leu Phe Ala Leu Ala 
x 5 10-15 

Leu Gin Thr Gly Val Cys Tyr Gly He Lys Trp Leu Ala Leu Ser Lys 
20 25 30 

Thr Pro Ser Ala Leu Ala Leu Asn Gin Thr Gin His Cys Lys Gin Leu 
35 40 45 

Glu Gly Leu Val Ser Ala Gin Val Gin Leu Cys Arg Ser Asn Leu Glu 
50 55 60 

Leu Met His Thr Val Val His Ala Ala Arg Glu Val Met Lys Ala Cys 
65 70 75 80 

Arq Arg Ala Phe Ala Asp Met Arg Trp Asn Cys Ser Ser He Glu Leu 
85 90 95 

Ala Pro Asn Tyr Leu Leu Asp Leu Glu Arg Gly Thr Arg Glu Ser Ala 
100 105 HO 

Phe Val Tyr Ala Ala Ala Asp Leu Lys Thr Arg Tyr Leu Ser Ala Thr 
115 120 125 

Lys Val Val His Arg Pro Met Gly Thr Arg Lys His Leu Val Pro Lys 
130 135 140 

Asp Leu Asp He Arg Pro Val Lys Asp Ser Glu Leu Val Tyr Leu Gin 
145 150 155 160 

Ser Ser Pro Asp Phe Cys Met Lys Asn Glu Lys Val Gly Ser His Gly 
165 170 175 

Thr Gin Asp Arg Gin Cys Asn Lys Thr Ser Asn Gly Ser Asp Ser Cys 
180 185 190 

Asp Leu Met Cys Cys Gly Arg Gly Tyr Asn Pro Tyr Thr Asp Arg Val 
195 200 205 

Val Glu Arg Cys His Cys Lys Tyr His Trp Cys Cys Tyr Val Thr Cys 
210 215 220 

Arg Arg Cys Glu Arg Thr Val Glu Arg Tyr Val Cys Lys 
225 230 235 
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<210> 43 
<211> 615 
<212> PRT 
<213> Humanus 



<400> 43 

Met Ser Pro Phe Leu Arg lie Gly Leu Ser Asn Phe Asp Cys Gly Ser 
15 10 15 

Cys Gin Ser Cys Gin Gly Glu Ala Val Asn Pro Tyr Cys Ala Val Leu 
20 25 30 

Val Lys Glu Tyr Val Glu Ser Glu Asn Gly Gin Met Tyr lie Gin Lys 
35 40 45 

Lys Pro Thr Met Tyr Pro Pro Trp Asp Ser Thr Phe Asp Ala His lie 
50 55 60 

Asn Lys Gly Arg Val Met Gin lie lie Val Lys Gly Lys Asn Val Asp 
65 70 75 80 

Leu lie Ser Glu Thr Thr Val Glu Leu Tyr Ser Leu Ala Glu Arg Cys 
85 90 95 

Arg Lys Asn Asn Gly Lys Thr Glu lie Trp Leu Glu Leu Lys Pro Gin 
100 105 110 

Gly Arg Met Leu Met Asn Ala Arg Tyr Phe Leu Glu Met Ser Asp Thr 
,115 120 125 

Lys Asp Met Asn Glu Phe Glu Thr Glu Gly Phe Phe Ala Leu His Gin 
130 135 140 

Arg Arg Gly Ala lie Lys Gin Ala Lys Val His His Val Lys Cys His 
145 150 155 160 

Glu Phe Thr Ala Thr Phe Phe Pro Gin Pro Thr Phe Cys Ser Val Cys 
165 170 175 

His Glu Phe Val Trp Gly Leu Asn Lys Gin Gly Tyr Gin Cys Arg Gin 
180 185 190 

Cys Asn Ala Ala lie His Lys Lys Cys lie Asp Lys Val lie Ala Lys 
195 200 205 

Cys Thr Gly Ser Ala lie Asn Ser Arg Glu Thr Met Phe His Lys Glu 
210 215 220 

Arg Phe Lys He Asp Met Pro His Arg Phe Lys Val Tyr Asn Tyr Lys 
225 230 235 240 

Ser Pro Thr Phe Cys Glu His Cys Gly Thr Leu Leu Trp Gly Leu Ala 
245 250 255 

Arg Gin Gly Leu Lys Cys Asp Ala Cys Gly Met Asn Val His His Arg 
260 265 270 

Cys Gin Thr Lys Val Ala Asn Leu Cys Gly He Asn Gin Lys Leu Met 
275 280 285 
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Ala Glu Ala Leu Ala Met lie Glu Ser Thr Gin Gin Ala Arg Cys Leu 
290 295 300 

Arg Asp Thr Glu Gin lie Phe Arg Glu Gly Pro Val Glu lie Gly Leu 
305 310 315 320 

Pro Cys Ser lie Lys Asn Glu Ala Arg Pro Pro Cys Leu Pro Thr Pro 
325 330 335 

Gly Lys Arg Glu Pro Gin Gly lie Ser Trp Glu Ser Pro Leu Asp Glu 
340 345 350 

Val Asp Lys Met Cys His Leu Pro Glu Pro Glu Leu Asn Lys Glu Arg 
355 360 365 

Pro Ser Leu Gin lie Lys Leu Lys lie Glu Asp Phe lie Leu His Lys 
370 375 380 

Met Leu Gly Lys Gly Ser Phe Gly Lys Val Phe Leu Ala Glu Phe Lys 
385 390 395 40 : 0 

Lys Thr Asn Gin Phe Phe Ala lie Lys Ala Leu Lys Lys Asp Val Val 
405 410 415 

Leu Met Asp Asp Asp Val Glu Cys Thr Met Val Glu Lys Arg Val Leu 
420 425 430 

Ser Leu Ala Trp Glu His Pro Phe Leu Thr His Met Phe Cys Thr Phe 
435 440 445 

Gin Thr Lys Glu Asn Leu Phe Phe Val Met Glu Tyr Leu Asn Gly Gly 
450 455 460 

Asp Leu Met Tyr His lie Gin Ser Cys His Lys Phe Asp Leu Ser Arg 
465 470 475 480 

Ala Thr Phe Tyr Ala Ala Glu lie lie Leu Gly Leu Gin Phe Leu His 
485 490 495 

Ser Lys Gly lie Val Tyr Arg Asp Leu Lys Leu Asp Asn lie Leu Leu 
500 505 510 

Asp Lys Asp Gly His lie Lys lie Ala Asp Phe Gly Met Cys Lys Glu 
515 520 525 

Asn Met Leu Gly Asp Ala Lys Thr Asn Thr Phe Cys Gly Thr Pro Asp 
530 535 540 

Tyr lie Ala Pro Glu lie Leu Leu Gly Gin Lys Tyr Asn His Ser Val 
545 550 555 560 

Asp Trp Trp Ser Phe Gly Val Leu Leu Tyr Glu Met Leu lie Gly Gin 
565 570 575 

Ser Pro Phe His Gly Gin Asp Glu Glu Glu Leu Phe His Ser He Arg 
580 585 590 

Met Asp Asn Pro Phe Tyr Pro Arg Trp Leu Glu Lys Glu Ala Lys Asp 
595 600 605 

Leu Leu Val Lys Val Arg Ser 
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610 



615 



<210> 44 
<211> 292 
<212> PRT 
<213> Humanus 

<400> 44 

Met Pro lie Thr Arg Met Arg Met Arg Pro Trp Leu Glu Met Gin lie 
15 10 15 

Asn Ser Asn Gin lie Pro Gly Leu lie Trp lie Asn Lys Glu Glu Met 
20 25 30 

He Phe Gin He Pro Trp Lys His Ala Ala Lys His Gly Trp Asp He 
35 40 45 

Asn Lys Asp Ala Cys Leu Phe Arg Ser Trp Ala He His Thr Gly Arg 
50 55 60 

Tyr Lys Ala Gly Glu Lys Glu Pro Asp Pro Lys Thr Trp Lys Ala Asn 
65 70 75 80 

Phe Arg Cys Ala Met Asn Ser Leu Pro Asp He Glu Glu Val Lys Asp 
85 90 95 

Gin Ser Arg Asn Lys Gly Ser Ser Ala Val Arg Val Tyr Arg Met Leu 
100 105 110 

Pro Pro Leu Thr Lys Asn Gin Arg Lys Glu Arg Lys Ser Lys Ser Ser 
115 120 125 

Arg Asp Ala Lys Ser Lys Ala Lys Arg Lys Ser Cys Gly Asp Ser Ser 
130 135 140 

Pro Asp Thr Phe Ser Asp Gly Leu Ser Ser Ser Thr Leu Pro Asp Asp 
145 150 155 160 

His Ser Ser Tyr Thr Val Pro Gly Tyr Met Gin Asp Leu Glu Val Glu 
165 170 175 

Gin Ala Leu Thr Pro Ala Leu Ser Pro Cys Ala Val Ser Ser Thr Leu 
180 185 190 

Pro Asp Trp His He Pro Val Glu Val Val Pro Asp Ser Thr Ser Asp 
195 200 205 

Leu Tyr Asn Phe Gin Val Ser Pro Met Pro Ser Thr Ser Glu Ala Thr 
210 215 220 

Thr Asp Glu Asp Glu Glu Gly Lys Leu Pro Glu Asp He Met Lys Leu 
225 230 235 240 

Leu Glu Gin Ser Glu Trp Gin Pro Thr Asn Val Asp Gly Lys Gly Tyr 
245 250 255 

Leu Leu Asn Glu Pro Gly Val Gin Pro Thr Ser Val Tyr Gly Asp Phe 
260 265 270 
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Ser Cys Lys Glu Glu Pro Glu He Asp Ser Pro Gly Gly Lys Lys Ala 
275 280 28.5 



Pro Gly Ser Leu 
290 



<210> 45 
<211> 702 
<212> PRT 
<213> Humanus 

<400> 45 

Met Trp Ser Trp Lys Cys Leu Leu Phe Trp Ala Val Leu Val Thr Ala 
15 10 15 

Thr Leu Cys Thr Ala Arg Pro Ser Pro Thr Leu Pro Glu Gin Ala Gin 
20 25 30 

Pro Trp Gly Ala Pro Val Glu Val Glu Ser Phe Leu Val His Pro Gly 
35 40 45 

Asp Leu Leu Gin Leu Arg Cys Arg Leu Arg Asp Asp Val Gin Ser He 
50 55 60 

Asn Trp Leu Arg Asp Gly Val Gin Leu Ala Glu Ser Asn Arg Thr Arg 
65 .70 75 80 

He Thr Gly Glu Glu Val Glu Val Gin Asp Ser Val Pro Ala Asp Ser 
85 90 95 

Gly Leu Tyr Ala Cys Val Thr Ser Ser Pro Ser Gly Ser Asp Thr Thr 
100 105 110 

Tyr Phe Ser Val Asn Val Ser Asp Ala Leu Pro Ser Ser Glu Asp Asp 
115 120 125 

Asp Asp Asp Asp Asp Ser Ser Ser Glu Glu ' Lys Glu Thr Asp Asn Thr 
130 135 140 

Lys Pro Asn Arg Met Pro Val Ala Pro Tyr Trp Thr Ser Pro Glu Lys 
145 150 155 160 

Met Glu Lys Lys Leu His Ala Val Pro Ala Ala Lys Thr Val Lys Phe 
165 170 175 

Lys Cys Pro Ser Ser Gly Thr Pro Asn Pro Thr Leu Arg Trp Leu Lys 
180 185 190 

Asn Gly Lys Glu Phe Lys Pro Asp His Arg He Gly Gly Tyr Lys Val 
195 200 205 

Arg Tyr Ala Thr Trp Ser He He Met Asp Ser Val Val Pro Ser Asp 
210 215 220 

Lys Gly Asn Tyr Thr Cys He Val Glu Asn Glu Tyr Gly Ser He Asn 
225 230 235 240 

His Thr Tyr Gin Leu Asp Val Val Glu Arg Ser Pro His Arg Pro He 
245 250 255 
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Leu Gin Ala Gly 
260 

Val Glu Phe Met 
275 

Trp Leu Lys His 
290 

Leu Pro Tyr Val 
305 

Lys Glu Met Glu 



Gly Glu Tyr Thr 
340 

Ser Ala Trp Leu 
355 

Met Thr Ser Pro 
370 

Phe Leu lie Ser 
385 

Ser Gly Thr Lys 



Leu Ala Lys Ser 
420 

Ser Ser Ala Ser 
435 

Leu Ser Ser Ser 
450 

Leu Pro Glu Asp 
465 

Gly Lys Pro Leu 



Ala lie Gly Leu 
500 

Val Lys Met Leu 
515 

lie Ser Glu Met 
530 

lie Asn Leu Leu 
545 

Val Glu Tyr Ala 



Leu Pro Ala Asn 



Cys Lys Val Tyr 
280 

lie Glu Val Asn 
295 

Gin lie Leu Lys 
310 

Val Leu His Leu 
325 

Cys Leu Ala Gly 



Thr Val Leu Glu 
360 

Leu Tyr Leu Glu 
375 

Cys Met Val Gly 
390 

Lys Ser Asp Phe 
405 

lie Pro Leu Arg 



Met Asn Ser Gly 
440 

Gly Thr Pro Met 
455 

Pro Arg Trp Glu 
470 

Gly Glu Gly Cys 
485 

Asp Lys Asp Lys 



Lys Ser Asp Ala 
520 



Glu Met Met Lys 
535 

Gly Ala Cys Thr 
550 

Ser Lys Gly Asn 
565 



Lys Thr Val Ala 
265 

Ser Asp Pro Gin 



Gly Ser Lys lie 
300 

Thr Ala Gly Val 
315 

Arg Asn Val Ser 
330 

Asn Ser lie Gly 
345 

Ala Leu Glu Glu 



lie lie lie Tyr 
380 

Ser Val lie Val 
395 

His Ser Gin Met 
410 

Arg Gin Val Thr 
425 

Val Leu Leu Val 



Leu Ala Gly Val 
460 

Leu Pro Arg Asp 
475 

Phe Gly Gin Val 
490 

Pro Asn Arg Val 
505 

Thr Glu Lys Asp 



Met lie Gly Lys 
540 

Gin Asp Gly Pro 
555 

Leu Arg Glu Tyr 
570 



Leu Gly Ser Asn 
270 

Pro His lie Gin 
285 

Gly Pro Asp Asn 



Asn Thr Thr Asp 
320 

Phe Glu Asp Ala 
335 

Leu Ser His His 
350 

Arg Pro Ala Val 
3 65 

Cys Thr Gly Ala 



Tyr Lys Met Lys 
400 



Ala Val His Lys 
415 

Val Ser Ala Asp 
430 

Arg Pro Ser Arg 
445 

Ser Glu Tyr Glu 



Arg Leu Val Leu 
480 

Val Leu Ala Glu 
495 

Thr Lys Val Ala 
510 

Leu Ser Asp Leu 
525 

His Lys Asn lie 



Leu Tyr Val lie 
560 



Leu Gin Ala Arg 
575 



40 



Arg Pro Pro Gly Leu 
580 

Glu Gin Leu Ser Ser 
595 

Arg Gly Met Glu Tyr 
610 

Ala Ala Arg Asn Val 
625 

Asp Phe Gly Leu Ala 
645 

Thr Thr Asn Gly Arg 
660 

Phe Asp Arg lie Tyr 
675 

Pro His Thr Pro Val 
690 



Glu Tyr Cys Tyr Asn 
585 

Lys Asp Leu Val Ser 
600 

Leu Ala Ser Lys Lys 
615 

Leu Val Thr Glu Asp 
630 

Arg Asp lie His His 
650 

Leu Pro Val Lys Trp 
665 

Thr His Gin Ser Asp 
680 

Cys Leu Trp Arg Asn 
695 



Pro Ser His Asn Pro Glu 
590 

Cys Ala Tyr Gin Val Ala 
605 

Cys lie His Arg Asp Leu 
620 

Asn Val Met Lys lie Ala 
635 640 

lie Asp Tyr Tyr Lys Lys 
655 

Met Ala Pro Glu Ala Leu 
670 

Val Trp Ser Phe Gly Val 
685 

Phe Ser Ser Cys 
700 



<210> 46 
<211> 295 
<212> PRT 
<213> Humanus 

<400> 46 
Met Pro Lys Arg 
1 

Leu Arg Thr Glu 

20 

Asn Asp Lys Glu 
35 

Pro Asp Gin Lys 
50 

Cys Ser Trp Asn 
65 

Leu Asp Trp Val 



Thr Lys Cys Ser 
100 

Gly Leu Ser His 
115 

Ser Gly Val Gly 
130 

Gly lie Ala Tyr 
145 



Gly Lys Lys Gly 
5 

Pro Glu Ala Lys 



Ala Ala Gly Glu 
40 

Thr Ser Pro Ser 
55 

Val Asp Gly Leu 
70 

Lys Glu Glu Ala 
85 

Glu Asn Lys Leu 



Gin Tyr Trp Ser 
120 

Leu Leu Ser Arg 
135 

Val Pro Asn Ala 
150 



Ala Val Ala Glu 
10 

Lys Ser Lys Thr 
25 

Gly Pro Ala Leu 



Gly Lys Pro Ala 
60 

Arg Ala Trp lie 
75 

Pro Asp lie Leu 
90 

Pro Ala Glu Leu 
105 

Ala Pro Ser Asp 



Gin Cys Pro Leu 
140 

Gly Arg Gly Leu 
155 



Asp Gly Asp Glu 
15 

Ala Ala Lys Lys 
30 

Tyr Glu Asp Pro 
45 

Thr Leu Lys lie 



Lys Lys Lys Gly 
80 

Cys Leu Gin Glu 
95 

Gin Glu Leu Pro 
110 

Lys Glu Gly Tyr 
125 

Lys Val Ser Tyr 



Val Arg Leu Glu 
160 
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Tyr Arg Gin Arg Trp Asp Glu Ala Phe Arg Lys Phe Leu Lys Gly Leu 
165 170 

Ala Ser Arg Lys Pro Leu Val Leu Cys Gly Asp Leu Asn Val Ala His 
180 I 85 190 

Glu Glu He Asp Leu Arg Asn Pro Lys Gly Asn Lys Lys Asn Ala Gly 
195 200 205 

Phe Thr Pro Gin Glu Arg Gin Gly Phe Gly Glu Leu Leu Gin Ala Val 
210 215 220 

Pro Leu Ala Asp Ser Phe Arg His Leu Tyr Pro Asn Thr Pro Tyr Ala 
225 230 235 *™ 

Tyr Thr Phe Trp Thr Tyr Met Met Asn Ala Arg Ser Lys Asn Val Gly 
245 250 

Trp Arg Leu Asp Tyr Phe Leu Leu Ser His Ser Leu Leu Pro Ala Leu 
260 265 

Cys Asp Ser Lys He Arg Ser Lys Ala Leu Gly Ser Asp His Cys Pro 
275 280 285 

He Thr Leu Tyr Leu Ala Leu 
290 295 



<210> 47 
<211> 342 
<212> PRT 
<213> Humanus 

Met°Pro 7 Lys Arg Gly Lys Lys Gly Ala Val Ala Glu Asp Gly Asp Glu 
! 5 10 15 

Leu Arg Thr Gly Lys Gly Met Lys Ser Ala Leu Leu Pro Arg Asn Cys 
20 25 30 

Gly Gly Gly Val Cys His Ser Leu Asp Val Arg Glu Pro Glu Ala Lys 
35 40 4 

Lys Ser Lys Thr Ala Ala Lys Lys Asn Asp Lys Glu Ala Ala Gly Glu 
50 55 60 

Gly Pro Ala Leu Tyr Glu Asp Pro Pro Asp Gin Lys Thr Ser Pro Ser 

65 70 75 

Gly Lys Pro Ala Thr Leu Lys lie Cys Ser Trp Asn Val Asp Gly Leu 



85 



Arg Ala Trp He Lys Lys Lys Gly Leu Asp Trp Val Lys Glu Glu Ala 

100 105 

Pro Asp lie Leu Cys Leu Gin Glu Thr Lys Cys Ser Glu Asn Lys Leu 
115 120 125 

Pro Ala Glu Leu Gin Glu Leu Pro Gly Leu Ser His Gin Tyr Trp Ser 



42 



130 135 140 

Ala Pro Ser Asp Lys Glu Gly. Tyr Ser Gly Val Gly Leu Leu Ser Arg 
145 150 155 160 

Gin Cys Pro Leu Lys Val Ser Tyr Gly lie Gly Asp Glu Glu His Asp 
165 170 175 

Gin Glu Gly Arg Val lie Val Ala Glu Phe Asp Ser Phe Val Leu Val 
180 185 190 

Thr Ala Tyr Val Pro As n Ala Gly Arg Gly Leu Val Arg Leu Glu Tyr 
195 200 205 

Arg Gin Arg Trp Asp Glu Ala Phe Arg Lys Phe Leu Lys Gly Leu Ala 
210 215 220 

Ser Arg Lys Pro Leu Val Leu Cys Gly Asp Leu Asn Val Ala His Glu 
225 230 235 240 

Glu lie Asp Leu Arg Asn Pro Lys Gly Asn Lys Lys Asn Ala Gly Phe 
245 250 255 

Thr Pro Gin Glu Arg Gin Gly Phe Gly Glu Leu Leu Gin Ala Val Pro 
260 265 270 

Leu Ala Asp Ser Phe Arg His Leu Tyr Pro Asn Thr Pro Tyr Ala Tyr 
275 280 285 

Thr Phe Trp Thr Tyr Met Met Asn Ala Arg Ser Lys Asn Val Gly Trp 
290 295 300 

Arg Leu Asp Tyr Phe Leu Leu Ser His Ser Leu Leu Pro Ala Leu Cys 
305 310 315 320 

Asp Ser Lys lie Arg Ser Lys Ala Leu Gly Ser Asp His Cys Pro lie 
325 330 335 

Thr Leu Tyr Leu Ala Leu 
340 



<210> 48 
<211> 305 
<212> PRT 
<213> Humanus 

<400> 48 

Met Phe Gin Ala Ala Glu Arg Pro Gin Glu Trp Ala Met Glu Gly Pro 
15 10 15 

Arg Asp Gly Leu Lys Lys Glu Arg Leu Leu Asp Asp Arg His Asp Ser 
20 25 30 

Gly Leu Asp Ser Met Lys Asp Glu Glu Tyr Glu Gin Met Val Lys Glu 
35 40 45 

Leu Gin Glu lie Arg Leu Glu Pro Gin Glu Val Pro Arg Gly Ser Glu 
50 55 60 
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Pro Trp Lys Gin Gin Leu Thr Glu Asp Gly Asp Ser Phe Leu His Leu 
65 70 75 80 

Ala lie lie His Glu Glu Lys Ala Leu Thr Met Glu Val lie Arg Gin 
85 90 95 

Val Lys Gly Asp Leu Ala Phe Leu Asn Phe Gin Asn Asn Leu Gin Gin 
100 105 110 

Thr Pro Leu His Leu Ala Val lie Thr Asn Gin Pro Glu lie Ala Glu 
115 120 125 

Ala Leu Leu Gly Ala Gly Cys Asp Pro Glu Leu Arg Asp Phe Arg Gly 
130 135 140 

Asn Thr Pro Leu His Leu Ala Cys Glu Gin Gly Cys Leu Ala Ser Val 
145 150 155 160 

Gly Val Leu Thr Gin Ser Cys Thr Thr Pro His Leu His Ser lie Leu 
165 170 175 

Lys Ala Thr Asn Tyr Asn Gly His Thr Cys Leu His Leu Ala Ser lie 
180 185 190 

His Gly Tyr Leu Gly lie Val Glu Leu Leu Val Ser Leu Gly Ala Asp 
195 200 205 

Val Asn Ala Gin Glu Pro Cys Asn Gly Arg Thr Ala Leu His Leu Ala 
210 215 220 

Val Asp Leu Gin Asn Pro Asp Leu Val Ser Leu Leu Leu Lys Cys Gly 
225 230 235 240 

Ala Asp Val Asn Arg Val Thr Tyr Gin Gly Tyr Ser Pro Tyr Gin Leu 
245 250 255 

Thr Trp Gly Arg Pro Ser Thr Arg lie Gin Gin Gin Leu Gly Gin Leu 
260 265 270 

Thr Leu Glu Asn Leu Gin Met Leu Pro Glu Ser Glu Asp Glu Glu Ser 
275 280 285 

Tyr Asp Thr Glu Ser Glu Phe Thr Glu Phe Thr Glu Asp Glu Val Ser 
290 295 300 

Leu 
305 



<210> 49 
<211> 289 
<212> PRT 
<213> Humanus 

<400> 49 

Met Phe Gin Ala Ala Glu Arg Pro Gin Glu Trp Ala Met Glu Gly Pro 
15 10 15 

Arq Asp Gly Leu Lys Lys Glu Arg Leu Leu Asp Asp Arg His Asp Ser 
20 25 30 



44 



Gly Leu Asp Ser Met Lys Asp Glu Glu Tyr Glu Gin Met Val Lys Glu 
35 .40 45 

Leu Gin Glu lie Arg Leu Glu Pro Gin Glu Val Pro Arg Gly Ser Glu 
50 55 60 

Pro Trp Lys Gin Gin Leu Thr Glu Asp Gly Asp Ser Phe Leu His Leu 
65 70 75 80 

Ala He He His Glu Glu Lys Ala Leu Thr Met Glu Val He Arg Gin 
85 90 95 

Val Lys Gly Asp Leu Ala Phe Leu Asn Phe Gin Asn Asn Leu Gin Gin 
100 105 110 

Thr Pro Leu His Leu Ala Val lie Thr Asn Gin Pro Glu He Ala Glu 
115 120 125 

Ala Leu Leu Gly Ala Gly Cys Asp Pro Glu Leu Arg Asp Phe Arg Gly 
130 135 ; 140 

Asn Thr Pro Leu His Leu Ala Cys Glu Gin Gly Cys Leu Ala Ser Val 
145 150 155 160 

Gly Val Leu Thr Gin Ser Cys Thr Thr Pro His Leu His Ser He Leu 
165 170 175 

Lys Ala Thr Asn Tyr Asn Gly Gin Glu Pro Cys Asn Gly Arg Thr Ala 
180 185 190 

Leu His Leu Ala Val Asp Leu Gin Asn Pro Asp Leu Val Ser Leu Leu 
195 200 205 

Leu Lys Cys Gly Ala Asp Val Asn Arg Val Thr Tyr Gin Gly Tyr Ser 
210 215 220 

Pro Tyr Gin Leu Thr Trp Gly Arg Pro Ser Thr Arg He Gin Gin Gin 
225 230 .235 240 

Leu Gly Gin Leu Thr Leu Glu Asn Leu Gin Met Leu Pro Glu Ser Glu 
245 250 255 

Asp Glu Glu Ser Tyr Asp Thr Glu Ser Glu Phe Thr Glu Phe Thr Glu 
260 265 270 

Asp Glu Leu Pro Tyr Asp Asp Cys Val Phe Gly Gly Gin Arg Leu Thr 
275 280 285 

Leu 



<210> 50 
<211> 921 
<212> PRT 
<213> Humanus 

<400> 50 

Met Ala Gly He Phe Tyr Phe Ala Leu Phe Ser Cys Leu Phe Gly He 



45 



! 5 10 15 

Cys Asp Ala Val Thr Gly Ser Arg Val Tyr Pro Ala Asn Glu Val Thr 



20 

Leu Leu Asp Ser Arg Ser 
35 



Val Gin Gly Glu Leu Gly Trp He Ala Ser 



40 45 



Pro Leu Glu Gly Gly Trp Glu Glu Val Ser He Met Asp Glu Lys Asn 
50 55 60 



Thr Pro lie Arg Thr Tyr Gin Val Cys Asn Val Met Glu Pro Ser Gin 
65 



70 75 80 



Asn Asn Trp Leu Arg Thr Asp Trp He Thr Arg Glu Gly Ala Gin Arg 

Val Tyr lie Glu lie Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu Pro 
100 105 H° 

Gly Val Met Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr Glu 
115 120 125 

Ser Asp Asn Asp Lys Glu Arg Phe He Arg Glu Asn Gin Phe Val Lys 
130 135 140 

lie Asp Thr He Ala Ala Asp Glu Ser Phe Thr Gin Val Asp He Gly 
145 150 i55 

Asp Arg lie Met Lys Leu Asn Thr Glu He Arg Asp Val Gly Pro Leu 
165 I 70 

Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys lie 
180 I 85 

Ala Leu Val Ser Val Arg Val Phe Tyr Lys Lys Cys Pro Leu Thr Val 
195 200 205 

Arg Asn Leu Ala Gin Phe Pro Asp Thr He Thr Gly Ala Asp Thr Ser 
• 210 215 220 

Ser Leu Val Glu Val Arg Gly Ser Cys Val Asn Asn Ser Glu Glu Lys 
225 230 235 

Asp Val Pro Lys Met Tyr Cys Gly Ala Asp Gly Glu. Trp Leu Val Pro 



245 

He Gly Asn Cys Leu Cys Asn Ala Gly His Glu Glu Arg Ser Gly Glu 

260 265 

Cys Gin Ala Cys Lys He Gly Tyr Tyr Lys Ala Leu Ser Thr Asp Ala 
275 280 285 

Thr Cys Ala Lys Cys Pro Pro His Ser Tyr Ser Val Trp Glu Gly Ala 

_ 1 ^ one. 3UU 

290 

Thr Ser Cys Thr Cys Asp Arg Gly Phe Phe Arg Ala Asp Asn Asp Ala 
305 310 315 

Ala Ser Met Pro Cys Thr Arg Pro Pro Ser Ala Pro Leu Asn Leu He 



325 



330 



46 



Ser Asn Val Asn Glu Thr Ser Val Asn Leu Glu Trp Ser Ser Pro Gin 
340 345 350 



Asn Thr Gly Gly Arg Gin Asp lie Ser Tyr Asn Val Val Cys Lys Lys 
355 360 365 

Cys Gly Ala Gly Asp Pro Ser Lys Cys Arg Pro Cys Gly Ser Gly Val 
370 375 380 

His Tyr Thr Pro Gin Gin Asn Gly Leu Lys Thr Thr Lys Val Ser lie 
385 390 395 400 

Thr Asp Leu Leu Ala His Thr Asn Tyr Thr Phe Glu lie Trp Ala Val 
405 410 415 

Asn Gly Val Ser Lys Tyr Asn Pro Asn Pro Asp Gin Ser Val Ser Val 
420 425 430 

Thr Val Thr Thr Asn Gin Ala Ala Pro Ser Ser lie Ala Leu Val Gin 
435 . 440 445 

Ala Lys Glu Val Thr Arg Tyr Ser Val Ala Leu Ala Trp Leu Glu Pro 
450 455 460 

Asp Arg Pro Asn Gly Val He Leu Glu Tyr Glu Val Lys Tyr Tyr Glu 
465 470 475 480 

Lys Asp Gin Asn Glu Arg Ser Tyr Arg He Val Arg Thr Ala Ala Arg 
485 490 495 

Asn Thr Asp He Lys Gly Leu Asn Pro Leu Thr Ser Tyr Val Phe His 
500 505 510 

Val Arg Ala Arg Thr Ala Ala Gly Tyr Gly Asp Phe Ser Glu Pro Leu 
515 520 525 

Glu Val Thr Thr Asn Thr Val Pro Ser Arg lie He Gly Asp Gly Ala 
530 535 540 

Asn Ser Thr Val Leu Leu Val Ser Val Ser Gly Ser Val Val Leu Val 
545 550 555 560 

Val He Leu He Ala Ala Phe Val He Ser Arg Arg Arg Ser Lys Tyr 
565 570 575 

Ser Lys Ala Lys Gin Glu Ala Asp Glu Glu Lys His Leu Asn Gin Gly 
580 585 590 

Val Arg Thr Tyr Val Asp Pro Phe Thr Tyr Glu Asp Pro Asn Gin Ala 
595 600 605 

Val Arg Glu Phe Ala Lys Glu He Asp Ala Ser Cys He Lys He Glu 
610 615 620 

Lys Val lie Gly Val Gly Glu Phe Gly Glu Val Cys Ser Gly Arg Leu 
625 630 635 640 

Lys Val Pro Gly Lys Arg Glu He Cys Val Ala He Lys Thr Leu Lys 
645 650 655 
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Ala Gly Tyr Thr Asp Lys Gin Arg Arg Asp Phe Leu Ser Glu Ala Ser 
660 665 670 

lie Met Gly Gin Phe Asp His Pro Asn lie lie His Leu Glu Gly Val 
675 680 685 

Val Thr Lys Cys Lys Pro Val Met lie lie Thr Glu Tyr Met Glu Asn 
690 695 700 

Gly Ser Leu Asp Ala Phe Leu Arg Lys Asn Asp Gly Arg Phe Thr Val 
705 710 715 720 

He Gin Leu Val Gly Met Leu Arg Gly He Gly Ser Gly Met Lys Tyr 
725 730 735 

Leu Ser Asp Met Ser Tyr Val His Arg Asp Leu Ala Ala Arg Asn He 
740 745 750 

Leu Val Asn Ser Asn Leu Val Cys Lys Val Ser Asp Phe Gly Met Ser 
755 760 765 

Arg Val Leu Glu Asp Asp Pro Glu Ala Ala Tyr Thr Thr Arg Gly Gly 
770 775 780 

Lys He Pro He Arg Trp Thr Ala Pro Glu Ala He Ala Tyr Arg Lys 
785 790 795 800 

Phe Thr Ser Ala Ser Asp Val Trp Ser Tyr Gly He Val Met Trp Glu 
805 810 815 

Val Met Ser Tyr Gly Glu Arg Pro Tyr Trp Asp Met Ser Asn Gin Asp 
820 825 830 

Pro Asn Thr Ala Leu Leu Asp Pro Ser Ser Pro Glu Phe Ser Ala Val 
835 840 845 

Val Ser Val Gly Asp Trp Leu Gin Ala He Lys Met Asp Arg Tyr Lys 
850 855 860 

Asp Asn Phe Thr Ala Ala Gly Tyr Thr Thr Leu Glu Ala Val Val His 
865 870 875 880 

Val Asn Gin Glu Asp Leu Ala Arg He Gly He Thr Ala He Thr His 
885 890 895 

Gin Asn Lys He Leu Ser Ser Val Gin Ala Met Arg Thr Gin Met Gin 
900 905 910 

Gin Met His Gly Arg Met Val Pro Val 
915 920 



<210> 51 
<211> 444 
<212> PRT 
<2 1 3 > Humanus 

<400> 51 

Met Asn Asp Phe Gly He Lys Asn Met Asp Gin Val Ala Pro Val Ala 
15 10 15 
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Asn Ser Tyr Asg Gly Thr Leu Lys Arg Gin Pro Ala Phe Asp Thr Phe 
20 25 30 

Asp Gly Ser Leu Phe Ala Val Phe Pro Ser Leu Asn Glu Glu Gin Thr 
35 40 45 

Leu Gin Glu Val Pro Thr Gly Leu Asp Ser lie Sex His Asp Ser Ala 
50 55 60 

Asn Cys Glu Leu Pro Leu Leu Thr Pro Cys Ser Lys Ala Val Met Ser 
65 70 75 80 

Gin Ala Leu Lys Ala Thr Phe Ser Gly Phe Phe Trp Ala Thr Asn Glu 
85 90 95 

Phe Ser Leu Val Asn Val Asn Leu Gin Arg Phe Gly Met Asn Gly Gin 
100 105 110 

Met Leu Cys Asn Leu Gly Lys Glu Arg Phe Leu Glu Leu Ala Pro Asp 
115 120 125 

Phe Val Gly Asp lie Leu Trp Glu His Leu Glu Gin Met lie Lys Glu 
130 135 140 

Asn Gin Glu Lys Thr Glu Asp Gin Tyr Glu Glu Asn Ser His Leu Thr 
145 150 155 160 

Ser Val Pro His Trp lie Asn Ser Asn Thr Leu Gly Phe Gly Thr Glu 
165 170 175 

Gin Ala Pro Tyr Gly Met Gin Thr Gin Asn Tyr Pro Lys Gly Gly Leu 
180 185 190 

Leu Asp Ser Met Cys Pro Ala Ser Thr Pro Ser Val Leu Ser Ser Glu 
195 200 205 

Gin Glu Phe Gin Met Phe Pro Lys Ser Arg Leu Ser Ser Val Ser Val 
210 215 220 

Thr Tyr Cys Ser Val Ser Gin Asp Phe Pro Gly Ser Asn Leu Asn Leu 
225 230 235 240 

Leu Thr Asn Asn Ser Gly Thr Pro Lys Asp His Asp Ser Pro Glu Asn 
245 250 255 

Gly Ala Asp Ser Phe Glu Ser Ser Asp Ser Leu Leu Gin Ser Trp Asn 
260 265 270 

Ser Gin Ser Ser Leu Leu Asp Val Gin Arg Val Pro Ser Phe Glu Ser 
275 280 285 

Phe Glu Asp Asp Cys Ser Gin Ser Leu Cys Leu Asn Lys Pro Thr Met 
290 295 300 

Ser Phe Lys Asp Tyr lie Gin Glu Arg Ser Asp Pro Val Glu Gin Gly 
305 310 315 320 

Lys Pro* Val He Pro Ala Ala Val Leu Ala Gly Phe Thr Gly Ser Gly 
325 330 335 
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Pro lie Gin Leu Trp 
340 

Cys Gin Ser Phe lie 
355 

Ala Asp Pro Asp Glu 
370 

Pro Lys Met Asn Tyr 
385 

Asp Lys Asn lie lie 
405 

Phe Val Cys Asp Leu 
420 

His Ala lie Leu Gly 
435 



Gin Phe Leu Leu Glu 
345 

Ser Trp Thr Gly Asp 
360 

Val Ala Arg Arg Trp 
375 

Glu Lys Leu Ser Arg 
390 

His Lys Thr Ser Gly 
410 



Gin Asn Leu Leu Gly 
425 

Val Gin Pro Asp Thr 
440 



Leu Leu Ser Asp Lys Ser 
350 

Gly Trp Glu Phe Lys Leu 
365 

Gly Lys Arg Lys Asn Lys 
380 

Gly Leu Arg Tyr Tyr Tyr 
395 400 

Lys Arg Tyr Val Tyr Arg 
415 

Phe Thr Pro Glu Glu Leu 
430 

Glu Asp 



<210> 52 
<211> 260 
<212> PRT 
<213> Humanus 

<400> 52 

Met Ala Gly Ser Ala Met Ser Ser Lys Phe Phe Leu Val Ala Leu Ala 
15 10 15 

lie Phe Phe Ser Phe Ala Gin Val Val lie Glu Ala Asn Ser Trp Trp 
20 25 30 

Ser Leu Gly Met Asn Asn Pro Val Gin Met Ser Glu Val Tyr lie lie 
35 40 45 

Gly Ala Gin Pro Leu Cys Ser Gin Leu Ala Gly Leu Ser Gin Gly Gin 
50 55 60 

Lys Lys Leu Cys His Leu Tyr Gin Asp His Met Gin Tyr lie Gly Glu 
65 70 75 80 

Gly Ala Lys Thr Gly He Lys Glu Cys Gin Tyr Gin Phe Arg His Arg 
85 90 95 

Arg Trp Asn Cys Ser Thr Val Asp Asn Thr Ser Val Phe Gly Arg Val 
100 105 HO 

Met Gin He Gly Ser Arg Glu Thr Ala Phe Thr Tyr Ala Val Ser Ala 
115 120 125 

Ala Gly Val Val Asn Ala Met Ser Arg Ala Cys Arg Glu Gly Glu Leu 
130 135 140 

Ser Thr Cys Gly Cys Ser Arg Ala Ala Arg Pro Lys Asp Leu Pro Arg 
145 150 155 160 

Asp Trp Leu Trp Gly Gly Cys Gly Asp Asn He Asp Tyr Gly Tyr Arg 
165 170 175 



50 



Phe Ala Lys Glu Phe Val Asp Ala Arg Glu Arg Glu Arg lie His Ala 
180 185 '~ 190 

Lys Gly Ser Tyr Glu Ser Ala Arg lie Leu Met Asn Leu His Asn Asn 
195 200 205 

Glu Ala Gly Arg Arg Thr Val Tyr Asn Leu Ala Asp Val Ala Cys Lys 
210 215 220 

Cys His Gly Val Ser Gly Ser Cys Ser Leu Lys Thr Cys Trp Leu Gin 
225 230 235 240 

Leu Ala Asp Phe Arg Lys Val Gly Asp Ala Leu Lys Glu Lys Tyr Asp 
245 250 255 

Thr Leu Val Gly 
260 



<210> 53 
<211> 719 
<212> PRT 
<213> Humanus 

<400> 53 

Met Ala Leu Arg Arg Ser Met Gly Arg Pro Gly Leu Pro Pro Leu Pro 
1 5 10 15 

Leu Pro Pro Pro Pro Arg Leu Gly Leu Leu Leu Ala Glu Ser Ala Ala 
20 25 30 

Ala Gly Leu Lys Leu Met Gly Ala Pro Val Lys Leu Thr Val Ser Gin 
35 40 45 

Gly Gin Pro Val Lys Leu Asn Cys Ser Val Glu Gly Met Glu Glu Pro 
50 55 60 

Asp He Gin Trp Val Lys Asp Gly Ala Val Val Gin Asn Leu Asp Gin 
65 70 75 80 

Leu Tyr He Pro Val Ser Glu Gin His Trp He Gly Phe Leu Ser Leu 
85 90 95 

Lys Ser Val Glu Arg Ser Asp Ala Gly Arg Tyr Trp Cys Gin Val Glu 
100 105 HO 

Asp Gly Gly Glu Thr Glu He Ser Gin Pro Val Trp Leu Thr Val Glu 
115 120 125 

Gly Val Pro Phe Phe Thr Val Glu Pro Lys Asp Leu Ala Val Pro Pro 
130 135 140 

Asn Ala Pro Phe Gin Leu Ser Cys Glu Ala Val Gly Pro Pro Glu Pro 
145 



150 155 160 



Val Thr He Val Trp Trp Arg Gly Thr Thr Lys He Gly Gly Pro Ala 
165 170 175 

Pro Ser Pro Ser Val Leu Asn Val Thr Gly Val Thr Gin Ser Thr Met 



51 



180 185 190 

Phe Ser Cys Glu Ala His Asn Leu Lys Gly Leu Ala Ser Ser Arg Thr 
195 200 205 

Ala Thr Val His Leu Gin Ala Leu Pro Ala Ala Pro Phe Asn He Thr 
210 215 ^ 220 

Val Thr Lys Leu Ser Ser Ser Asn Ala Ser Val Ala Trp Met Pro Gly 
225 230 235 240 

Ala Asp Gly Arg Ala Leu Leu Gin Ser Cys Thr Val Gin Val Thr Gin 
245 250 255 

Ala Pro Gly Gly Trp Glu Val Leu Ala Val Val Val Pro Val Pro Pro 
260 265 270 

Phe Thr Cys Leu Leu Arg Asp Leu Val Pro Ala Thr Asn Tyr Ser Leu 
275 280 285 

Arg Val Arg Cys Ala Asn Ala Leu Gly Pro Ser Pro Tyr Ala Asp Trp 
290 295 300. 

Val Pro Phe Gin Thr Lys Gly Leu Ala Pro Ala Ser Ala Pro Gin Asn 
305 310 315 320 

Leu His Ala He Arg Thr Asp Ser Gly Leu He Leu Glu Trp Glu Glu 
325 330 335 

Val He Pro Glu Ala Pro Leu Glu Gly Pro Leu Gly Pro Tyr Lys Leu 
340 345 350 

Ser Trp Val Gin Asp Asn Gly Thr Gin Asp Glu Leu Thr Val Glu Gly 
355 360 365 

Thr Arg Ala Asn Leu Thr Gly Trp Asp Pro Gin Lys Asp Leu He Val 
370 375 380 

Arq Val Cys Val Ser Asn Ala Val Gly Cys . Gly Pro Trp Ser Gin Pro 
385 390 395 400 

Leu Val Val Ser Ser His Asp Arg Ala Gly Gin Gin Gly Pro Pro His 
405 410 415 

Ser Arg Thr Ser Trp Val Pro Val Val Leu Gly Val Leu Thr Ala Leu 
420 425 430 

Val Thr Ala Ala Ala Leu Ala Leu He Leu Leu Arg Lys Arg Arg Lys 
435 440 445 

Glu Thr Arg Phe Gly Gin Ala Phe Asp Ser Val Met Ala Arg Gly Glu 
450 455 460 

Pro Ala Val His Phe Arg Ala Ala Arg Ser Phe Asn Arg Glu Arg Pro 
465 470 475 480 

Glu Arg He Glu Ala Thr Leu Asp Ser Leu Gly He Ser Asp Glu Leu 
485 490 495 

Lys Glu Lys Leu Glu Asp Val Leu He Pro Glu Gin Gin Phe Thr Leu 
500 505 510 



52 



Gly Arg Met Leu Gly Lys Gly Glu. Phe Gly Ser Val Arg Glu Ala Gin 
515 52t) 525 

Leu Lys Gin Glu Asp Gly. Ser Phe Val Lys Val Ala Val Lys Met Leu 
530 535 540 



Lys Ala Asp He He Ala Ser Ser Asp He Glu Glu Phe Leu Arg Glu 



545 550 555 

Lvs Glu Phe Asp His Pro His Val Ala Lys Leu 
565 570 575 



Ala 



Gly Val Ser Leu Arg Ser Arg Ala Lys Gly Arg Leu Pro lie Pro Met 

580 585 590 

Val lie Leu Pro Phe Met Lys His Gly Asp Leu His Ala Phe Leu Leu 

595 600 605 

Ala Ser Arg He Gly Glu Asn Pro Phe Asn Leu Pro Leu Gin Thr Leu 

610 615 620 

He Arg Phe Met Val Asp He Ala Cys Gly Met Glu Tyr Leu Ser Ser 



625 



630 



Arg Asn Phe He His Arg Asp Leu Ala Ala Arg Asn Cys Met Leu Ala 
645 650 

Glu Asp Met Thr Val Cys Val Ala Asp Phe Gly Leu Ser Arg Lys He 

„ rt fifiS 



Pro 



660 665 

Tyr Ser Asp Cys Arg Tyr He Leu Thr Pro Gly Gly Leu Ala Glu Gin 
675 680 685 

Gly Gin Ala Glu His Gin Pro Glu Ser Pro Leu Asn Glu Thr Gin 
690 695 700 

Arg Leu Leu Leu Leu Gin Gin Gly Leu Leu Pro His Ser Ser Cys 
705 710 -715 



<210> 54 
<211> 848 
<212> PRT 
<213> Humanus 

Me2°cys 4 Arg He Ala Gly Ala Leu Arg Thr Leu Leu Pro Leu Leu Ala 

1 ^ 

Ala Leu Leu Gin Ala Ser Val Glu Ala Ser Gly Glu He Ala Leu Cys 
20 25 30 

Lys Thr Gly Phe Pro Glu Asp Val Tyr Ser Ala Val Leu Ser Lys Asp 

35 40 4 5 

Val His Glu Gly Gin Pro Leu Leu Asn Val Lys Phe Ser Asn Cys Asn 
50 



55 60 



Gly Lys Arg Lys Val Gin Tyr Glu Ser Ser Glu Pro Ala Asp Phe Lys 



53 



65 



70 



75 



80 



Val Asp Glu Asp Gly Met Val Tyr Ala Val Arg Ser Phe Pro Leu Se£ 
85 90 95 

Ser Glu His Ala Lys Phe Leu lie Tyr Ala Gin Asp Lys Glu Thr Gin 
100 105 110 

Glu Lys Trp Gin Val Ala Val Lys Leu Ser Leu Lys Pro Thr Leu Thr 
115 120 125 

Glu Glu Ser Val Lys Glu Ser Ala Glu Val Glu Glu lie Val Phe Pro 
130 135 140 

Arg Gin Phe Ser Lys His Ser Gly His Leu Gin Arg Gin Lys Arg Asp 
145 150 155 160 

Trp Val lie Pro Pro lie Asn Leu Pro Glu Asn Ser Arg Gly Pro Phe 
165 170 175 

Pro Gin Glu Leu Val Arg lie Arg Ser Asp Arg Asp Lys Asn Leu Ser 
180 185 190 

Leu Arg Tyr Ser Val Thr Gly Pro Gly Ala Asp Gin Pro Pro Thr Gly 
195 200 205 

lie Phe lie lie Asn Pro lie Ser Gly Gin Leu Ser Val Thr Lys Pro 
210 215 220 

Leu Asp Arg Glu Gin lie Ala Arg Phe His Leu Arg Ala His Ala Val 
225 230 235 240 

Asp lie Asn Gly Asn Gin Val Glu Asn Pro lie Asp lie Val lie Asn 
245 250 255 

Val lie Asp Met Asn Asp Asn Arg Pro Glu Phe Leu His Gin Val Trp 
260 265 270 

Asn Gly Thr Val Pro Glu Gly Ser Lys Pro Gly Thr Tyr Val Met Thr 
275 280 285 

Val Thr Ala lie Asp Ala Asp Asp Pro Asn Ala Leu Asn Gly Met Leu 
290 295 300 

Arg Tyr Arg lie Val Ser Gin Ala Pro Ser Thr Pro Ser Pro Asn Met 
305 310 315 320 

Phe Thr He Asn Asn Glu Thr Gly Asp He He Thr Val Ala Ala Gly 
325 330 335 

Leu Asp Arg Glu Lys Val Gin Gin Tyr Thr Leu He He Gin Ala Thr 
340 345 350 

Asp Met Glu Gly Asn Pro Thr Tyr Gly Leu Ser Asn Thr Ala Thr Ala 
355 360 365 

Val He Thr Val Thr Asp Val Asn Asp Asn Pro Pro Glu Phe Thr Ala 
370 375 380 

Met Thr Phe Tyr Gly Glu Val Pro Glu Asn Arg Val Asp He He Val 
385 390 395 400 



54 



r>m Hi <? Thr Pro Ala Trp 

Ala Asn Leu th. Val Th. «P «P *S 

405 

ai. vi «, «. .« «v «p - - "S - 

Il8 G m ^ Z «» - «■ «p - - v.i v. ^ 

435 

Met Phe Val Leu Thr Val Ala Ala 

455 ' 

He Gin His Pro Pro Gin Ser 



Pro lie Asp Phe Glu Thr Asn Arg nee rnc ^ 



470 

465 

Thr Ala Thr Val Ser vax - q 

485 



Phe Ala Pro Asn Pro Ly 

500 



, » rm Val Pro Leu Ala Lys Gly He Gin 48Q 

Glu Asn Gin vax " u ^ 475 

V,! S«Val The Val Xl.»pV»l AsnGlU Asn .» „. 

le Jle M , Gin Glu Glu, Gly L.u Hi, Al. 

505 3i 

, phe xhr Ala Gin Asp Pro Asp Arg Tyr Met 
Gly Thr Met Leu Thr Thr Phe Thr AX ^ 

Gin Gin Z XL ^ ^ f/ 5 - - ~ £ " 

Ly , Hi ~P - -1 CI, Gin XL - £ XL - -I «P 

* val Lvs Asn Asn He Tyr Asn Ala Thr Phe Leu 
Arg Glu Ser Pro Asn Val Lys Asn ^ 575 

Ala ,.r *P «n ^ ,1. »~ Pro « - ^ - Glv - - - 

Ile Tyc W u Z «P XI. «» «P «» — Gin Val l,u «. - 

595 

«.v. Pr„ Asd Pro Asn Ser He Asn He Thr 
Glu Ala Glu Thr Cys Glu Thr Pro Asp ^ 

n. ASD He Asp Pro Asn Ala Gly Pro Phe Ala Phe Asp 
Ala Leu Asp Tyr Asp lie Asp g35 640 

625 

Leu M «. ~ - -x - - - -j - - - - ss - 

645 

*n rin Leu Asn Leu Lys He Lys Phe Leu Glu 
Leu Asn Gly Asp Phe Ala Gin Leu Asn 67Q 
660 

« Tip Tie He Thr Asp Ser Gly Asn Pro 
Ala Gly He Tyr Glu Val Pro He g85 

675 

Val Lys Val Cys Gin Cys Asp 

Pro Lys Ser Asn lie «i ^- 



Tlo c er He Leu Arg Val Lys Val 
Asn He ser xxe u * 70Q 

... r=n Gl y «P C The Tp Val »p «, XI. Val Gly - «X» «■ 



705 



55 



Gly Thr Gly Ala lie 
725 

lie Leu Val Leu Met 
740 

Arg Gin Ala Lys Gin 
755 

Asn lie Leu Lys Tyr 
770 

Tyr Asp Leu Ser Gin 
785 

lie Lys Pro Val Gly 
805 

Glu Pro Gin Tyr Pro 
820 

Gly Asp Phe lie Asn 
835 



lie Ala lie Leu Leu 
730 

Phe Val Val Trp Met 
745 

Leu Leu lie Asp Pro 
760 

Asp Glu Glu Gly Gly 
775 

Leu Gin Gin Pro Asp 
790 

lie Arg Arg Met Asp 
810 

Val Arg Ser Ala Ala 
825 

Glu Lys Thr Trp Pro 
840 



Cys lie lie lie Leu Leu 
735 

Lys Arg Arg Asp Lys Glu 
750 

Glu Asp Asp Val Arg Asp 
765 

Gly Glu Glu Asp Gin Asp 
780 

Thr Val Glu Pro Asp Ala 
795 800 

Glu Arg Pro lie His Ala 
815 

Pro His Pro Gly Asp lie 
830 

lie Gin Ser Leu His Leu 
845 



<210> 55 
<211> 103 
<212> PRT 
<213> Humanus 

<400> 55 

Met Glu Arg Val Lys Met lie Asn Val Gin Arg Leu Leu Glu Ala Ala 
15 10 15 

Glu Phe Leu Glu Arg Arg Glu Arg Glu Cys Glu His Gly Tyr Ala Ser 
20 25 30 

Ser Phe Pro Ser Met Pro Ser Pro Arg Leu Gin His Ser Lys Pro Pro 
35 40 45 

Arg Arg Leu Ser Arg Ala Gin Lys His Ser Ser Gly Ser Ser Asn Thr 
50 55 60 

Ser Thr Ala Asn Arg Ser Thr His Asn Glu Leu Glu Lys Asn Arg Leu 
65 70 75 80 

Lys Asn Trp Leu Val Gly Arg Arg Asp Thr Arg Gly Met Lys Met Leu 
85 90 95 

Leu Lys Ala lie Ala Val lie 
100 



<210> 56 
<211> 234 
<212> PRT 



56 



<213> Humanus 



<400> 56 

Met Glu Lys His lie Asn Thr Phe Leu Gin Asn Val Gin He Leu Leu 
1 5 10 15 

Glu Ala Ala Ser Tyr Leu Glu Gin He Glu Lys Glu Asn Lys Lys Cys 
20 25 30 

Glu His Gly Tyr Ala Ser Ser Phe Pro Ser Met Pro Ser Pro Arg Leu 
.35 40 45 

Gin His Ser Lys Pro Pro Arg Arg Leu Ser Arg Ala Gin Lys His Ser 
50 55 60 

Ser Gly Ser Ser Asn Thr Ser Thr Ala Asn Arg Ser Thr His Asn Glu 
65 70 75 80 

Leu Glu Lys Asn Arg Arg Ala His Leu Arg Leu Cys Leu Glu Arg Leu 
85 90 95 

Lys Val Leu He Pro Leu Gly Pro Asp Cys Thr Arg His Thr Thr Leu 
100 105 HO 

Gly Leu Leu Asn Lys Ala Lys Ala His He Lys Lys Leu Glu Glu Ala 
115 120 125 

Glu Arg Lys Ser Gin His Gin Leu Glu Asn Leu Glu Arg Glu Gin Arg 
130 135 140 

Phe Leu Lys Trp Arg Leu Glu Gin Leu Gin Gly Pro Gin Glu Met Glu 
145 150 155 160 

Arq He Arg Met Asp Ser He Gly Ser Thr He Ser Ser Asp Arg Ser 
165 170 175 

Asp Ser Glu Arg Glu Glu He Glu Val Asp Val Glu Ser Thr Glu Phe 
180 185 190 

Ser His Gly Glu Val Asp Asn He Ser Thr Thr Ser He Ser Asp He 
195 200 205 

Asp Asp His Ser Ser Leu Pro Ser He Gly Ser Asp Glu Gly Tyr Ser 
210 215 220 

Ser Ala Ser Val Lys Leu Ser Phe Thr Ser 
225 230 



<210> 57 
<211> 329 
<212> PRT 
<213> Humanus 

<400> 57 

Met Glu Ser Pro Ala Ser Ser Gin Pro Ala Ser Met Pro Gin Ser Lys 
15 10 15 

Gly Lys Ser Lys Arg Lys Lys Asp Leu Arg He Ser Cys Met Ser Lys 
20 25 30 



57 



Pro Pro Ala Pro Asn Pro Thr. : Pro Pro Arg Asn Leu Asp Ser Arg Thr 
35 40 45 

Phe lie Thr lie Gly Asp- Arg Asn Phe Glu Val Glu Ala Asp Asp Leu 
50 55 60 

Val Thr He Ser Glu Leu Gly Arg Gly Ala Tyr Gly Val Val Glu Lys 
65 70 75 80 

Val Arg His Ala Gin Ser Gly Thr He Met Ala Val Lys Arg He Arg 
85 90 95 

Ala Thr Val Asn Ser Gin Glu Gin Lys Arg Leu Leu Met Asp Leu Asp 
100 105 HO 

He Asn Met Arg Thr Val Asp Cys Phe Tyr Thr Val Thr Phe Tyr Gly 
115 120 125 

Ala Leu Phe Arg Glu Gly Asp Val Trp He Cys Met Glu Leu Met Asp 
130 135 140 

Thr Ser Leu Asp Lys Phe Tyr Arg Lys Val Leu Asp Lys Asn Met Thr 
145 150 155 160 

He Pro Glu Asp He Leu Gly Glu He Ala Val Ser He Val Arg Ala 
165 170 175 

Leu Glu His Leu His Ser Lys Leu Ser Val He His Arg Asp Val Lys 
180 185 190 

Pro Ser Asn Val Leu He Asn Lys Glu Gly His Val Lys Met Cys Asp 
195 200 205 

Phe Gly lie Ser Gly Tyr Leu Val Asp Ser Val Ala Lys Thr Met Asp 
210 215 220 

Ala Gly Cys Lys Pro Tyr Met Ala Pro Glu Arg He Asn Pro Glu Leu 
225 230 235 240 

Asn Gin Lys Gly Tyr Asn Val Lys -Ser Asp Val Trp Ser Leu Gly He 
245 250 255 

Thr Met He Glu Met Ala He Leu Arg Phe Pro Tyr Glu Ser Trp Gly 
260 265 270 

Thr Pro Phe Gin Gin Leu Lys Gin Val Val Glu Glu Pro Ser Pro Gin 
275 280 285 

Leu Pro Ala Asp Arg Phe Ser Pro Glu Phe Val Asp Phe Thr Ala Gin 
290 295 300 

Cys Leu Arg Lys Asn Pro Ala Glu Arg Met Ser Tyr Leu Glu Leu He 
305 310 315 320 

Gly Ala Asp Arg Phe Ser Pro Thr Pro • 
325 



<210> 58 



58 



<211> 292 
<212> PRT 
<213> Humanus 



<400> 58 

Met Pro Glu lie Arg Leu Arg His Val Val Ser Cys Ser Ser Gin Asp 
! 5 10 15 

Ser Thr His Cys Ala Glu Asn Leu Leu Lys Ala Asp Thr Tyr Arg Lys 
20 25 30 

Trp Arg Ala Ala Lys Ala Gly Glu Lys Thr lie Ser Val Val Leu Gin 
35 40 45 

Leu Glu Lys Glu Glu Gin He His Ser Val Asp He Gly Asn Asp Gly 
50 55 60 - 

Ser Ala Phe Val Glu Val Leu Val Gly Ser Ser Ala Gly Gly Ala Gly 
65 70 75 80 

Glu Gin Asp Tyr Glu Val Leu Leu Val Thr Ser Ser Phe Met Ser Pro 
85 90 95 

Ser Glu Ser Arg Ser Gly Ser Asn Pro Asn Arg Val Arg Met Phe Gly 
100 105 110 

Pro Asp Lys Leu Val Arg Ala Ala Ala Glu Lys Arg Trp Asp Arg Val 
115 120 125 

Lys He Val Cys Ser Gin Pro Tyr Ser Lys Asp Ser Pro Phe Gly Leu 
130 135 140 

Ser Phe Val Arg Phe His Ser Pro Pro Asp Lys Asp Glu Ala Glu Ala 
145 150 155 160 

Pro Ser Gin Lys Val Thr Val Thr Lys Leu Gly Gin Phe Arg Val Lys 
165 170 175 

Glu Glu Asp Glu Ser Ala Asn Ser Leu Arg ' Pro Gly Ala Leu Phe Phe 
180 185 190 

Ser Arg He Asn Lys Thr Ser Pro Val Thr Ala Ser Asp Pro Ala Gly 
195 200 205 

Pro Ser Tyr Ala Ala Ala Thr Leu Gin Ala Ser Ser Ala Ala Ser Ser 
210 215 220 

Ala Ser Pro Val Ser Arg Ala He Gly Ser Thr Ser Lys Pro Gin Glu 
225 230 235 240 

Ser Pro Trp His Ser Phe Val Pro Asp Gly Ser Thr Val Ala Met Arg 
245 250 255 

Ser Arg Ser Tyr Phe Leu Thr Ser Ser Met Gly Trp Cys Arg Lys Pro 
260 265 270 

Glu Val Cys Ala He His Thr His Thr His Thr His Thr His Thr His 
275 280 285 

Thr Arg Cys He 
290 



59 



<210> 59 
<211> 266 
<212> PRT 
<213> Humanus 

<400> 59 

Met Pro Glu He Arg Leu Arg His Val Val Ser Cys Ser Ser Gin Asp 
1 5 10 15 

Ser Thr His Cys Ala Glu Asn Leu Leu Lys Ala Asp Thr Tyr Arg L*ys 
20 25 30 

Trp Arg Ala Ala Lys Ala Gly Glu Lys Thr He Ser Val Val Leu Gin 
35 40 45 

Leu Glu Lys Glu Glu Gin He His Ser Val Asp He Gly Asn Asp Gly 
50 55 60 

Ser Ala Phe Val Glu Val Leu Val Gly Ser Ser Ala Gly Gly Ala Gly 
65 70 75 80 

Glu Gin Asp Tyr Glu Val Leu Leu Val Thr Ser Ser Phe Met. Ser Pro 
85 90 95 

Ser Glu Ser Arg Ser Gly Ser Asn Pro Asn Arg Val Arg Met Phe Gly 
100 105 HO 

Pro Asp Lys Leu Val Arg Ala Ala Ala Glu Lys Arg Trp Asp Arg Val 
115 120 125 

Lys He Val Cys Ser Gin Pro Tyr Ser Lys Asp Ser Pro Phe Gly Leu 
130 135 140 

Ser Phe Val Arg Phe His Ser Pro Pro Asp Lys Asp Glu Ala Glu Ala 
145 



150 155 160 



Pro Ser Gin Lys Val Thr Val Thr Lys Leu Gly Gin Phe Arg Val Lys 
165 170 175 

Glu Glu Asp Glu Ser Ala Asn Ser Leu Arg Pro Gly Ala Leu Phe Phe 
180 185 190 

Ser Arg He Asn Lys Thr Ser Pro Val Thr Ala Ser Asp Pro Ala Gly 
195 200 205 

Pro Ser Tyr Ala Ala Ala Thr Leu Gin Ala Ser Ser Ala Ala Ser Ser 
210 215 220 

Ala Ser Pro Val Ser Arg Ala He Gly Ser Thr Ser Lys Pro Gin Glu 
225 230 235 240 

Ser Ser Asp Phe Gly Gly Val Glu Glu Glu Arg Ser Trp Arg Pro Gin 
245 250 255 

Ser He Pro He Pro Ser Ala Pro Gly Ser 
260 265 



60 



<210> 60 
<211> 247 
<212> PRT 
<213> Humanus 

<400> 60 

Met Pro Glu lie Arg Leu Arg His Val Val Ser Cys Ser Ser Gin Asp 
15 10 15 

Ser Thr His Cys Ala Glu Asn Leu Leu Lys Ala Asp Thr Tyr Arg Lys 
20 25 30 

Trp Arg Ala Ala Lys Ala Gly Glu Lys Thr lie Ser Val Val Leu Gin 
35 40 45 

Leu Glu Lys Glu Glu Gin lie His Ser Val Asp lie Gly Asn Asp Gly 
50 55 60 

Ser Ala Phe Val Glu Val Leu Val Gly Ser Ser Ala Gly Gly Ala Gly 
65 70 75 80 

Glu Gin Asp Tyr Glu Val Leu Leu Val Thr Ser Ser Phe Met Ser Pro 
85 90 95 

Ser Glu Ser Arg Ser Gly Ser Asn Pro Asn Arg Val Arg Met Phe Gly 
100 105 110 

Pro Asp Lys Leu Val Arg Ala Ala Ala Glu Lys Arg Trp Asp Arg Val 
115 120 125 

Lys lie Val Cys Ser Gin Pro Tyr Ser Lys Asp Ser Pro Phe Gly Leu 
130 135 140 

Ser Phe Val Arg Phe His Ser Pro Pro Asp Lys Asp Glu Ala Glu Ala 
145 150 155 160 

Pro Ser Gin Lys Val Thr Val Thr Lys Leu Gly Gin Phe Arg Val Lys 
165 170 175 

Glu Glu Asp Glu Ser Ala Asn Ser Leu Arg Leu Glu Asp Tyr Met Ser 
180 185 190 

Asp Arg Val Gin Phe Val lie Thr Ala Gin Glu Trp Asp Pro Ser Phe 
195 200 205 

Glu Glu Ala Leu Met Asp Asn Pro Ser Leu Ala Phe Val Arg Pro Arg 
210 215 220 

Trp lie Tyr Ser Cys Asn Glu Lys Gin Lys Leu Leu Pro His Gin Leu 
225 230 235 240 

Tyr Gly Val Val Pro Gin Ala 
245 



<210> 61 
<211> 624 
<212> PRT 
<213> Humanus 



61 



<400> 61 

Met Pro Glu lie Arg Leu Arg His Val Val Ser Cys Ser Ser Gin Asp 
x 5 10 15 

Ser Thr His Cys Ala Glu Asn Leu Leu Lys Ala Asp Thr Tyr Arg Lys 
20 25 30 

Trn Ara Ala Ala Lys Ala Gly Glu Lys Thr lie Ser Val Val Leu Gin 
35 40 45 

Leu Glu Lys Glu Glu Gin lie His Ser Val Asp He Gly Asn Asp Gly 
50 55 60 

Ser Ala Phe Val Glu Val Leu Val Gly Ser Ser Ala Gly Gly Ala Gly 
65 70 75 80 

Glu Gin Asp Tyr Glu Val Leu Leu Val Thr Ser Ser Phe Met Ser Pro 
85 90 95 

Ser Glu Ser Arg Ser Gly Ser Asn Pro Asn Arg Val Arg Met Phe Gly 
100 105 HO 

Pro Asp Lys Leu Val Arg Ala Ala Ala Glu Lys Arg Trp Asp Arg Val 
115 120 125 

Lys He Val Cys Ser Gin Pro Tyr Ser Lys Asp Ser Pro Phe Gly Leu 
130 135 140 

Ser Phe Val Arg Phe His Ser Pro Pro Asp Lys Asp Glu Ala Glu Ala 
145 150 155 160 

Pro Ser Gin Lys Val Thr Val Thr Lys Leu Gly Gin Phe Arg Val Lys 
165 170 175 

Glu Glu Asp Glu Ser Ala Asn Ser Leu Arg Pro Gly Ala Leu Phe Phe 
130 185 190 

Ser Arg He Asn Lys Thr Ser Pro Val Thr. Ala Ser Asp Pro Ala Gly 
195 200 205 

Pro Ser Tyr Ala Ala Ala Thr Leu Gin Ala Ser Ser Ala Ala Ser Ser 
210 215 220 

Ala Ser Pro Val Ser Arg Ala He Gly Ser Thr Ser Lys Pro Gin Glu 
225 230 235 240 

Ser Pro Lys Gly Lys Arg Lys Leu Asp Leu Asn Gin Glu Glu Lys Lys 
245 250 255 

Thr Pro Ser Lys Pro Pro Ala Gin Leu Ser Pro Ser Val Pro Lys Arg 
260 265 270 

Pro Lys Leu Pro Ala Pro Thr Arg Thr Pro Ala Thr Ala Pro Val Pro 
275 280 285 

Ala Arg Ala Gin Gly Ala Val Thr Gly Lys Pro Arg Gly Glu Gly Thr 
290 295 300 

Glu Pro Arq Arg Pro Arg Ala Gly Pro Glu Glu Leu Gly Lys He Leu 
305 310 315 320 



62 



Gin Gly Val Val Val Val Leu Ser Gly Phe Gin Asn Pro Phe Arg Ser 
325 330 335 

Glu Leu Arg Asp Lys Ala Leu Glu Leu Gly Ala Lys Tyr Arg Pro Asp 
340 345 350 

Trp Thr Arg Asp Ser.-Thr His Leu lie Cys Ala Phe Ala Asn Thr Pro 
355 360 365 

Lys Tyr Ser Gin Val Leu Gly Leu Gly Gly Arg lie Val Arg Lys Glu 
370 375 380 

Trp Val Leu Asp Cys His Arg Met Arg Arg Arg Leu Pro Ser Arg Arg 
385 390 395 400 

Tyr Leu Met Ala Gly Pro Gly Ser Ser Ser Glu Glu Asp Glu Ala Ser 
405 410 415 

His Ser Gly Gly Ser Gly Asp Glu Ala Pro Lys Leu Pro Gin Lys Gin 
420 425 430 

Pro Gin Thr Lys Thr Lys Pro Thr Gin Ala Ala Gly Pro Ser Ser Pro 
435 440 445 

Gin Lys Pro Pro Thr Pro Glu Glu Thr Lys Ala Ala Ser Pro Val Leu 
450 455 460 

Gin Glu Asp lie Asp lie Glu Gly Val Gin Ser Glu Gly Gin Asp Asn 
465 470 475 480 

Gly Ala Glu Asp Ser Gly Asp Thr Glu Asp Glu Leu Arg Arg Val Ala 
485 490 495 

Glu Gin Lys Glu His Arg Leu Pro Pro Gly Gin Glu Glu Asn Gly Glu 
500 505 510 

Asp Pro Tyr Ala Gly Ser Thr Asp Glu Asn Thr Asp Ser Glu Glu His 
515 520 525 

Gin Glu Pro Pro Asp Leu Pro Val Pro Glu Leu Pro Arg Phe Leu Pro 
530 535 540 

Gly Gin Ala Leu Leu Ser Leu Arg Gly Val Pro Trp Gly Arg Ala Ala 
545 550 555 560 

Glu Thr His Pro lie Arg His Ser Leu Gin Trp Gly Ala Pro Trp His 
565 570 575 

Ser Phe Val Pro Asp Gly Ser Thr Val Ala Met Arg Ser Arg Ser Tyr 
580 585 590 

Phe Leu Thr Ser Ser Met Gly Trp Cys Arg Lys Pro Glu Val Cys Ala 
595 600 605 

lie His Thr His Thr His Thr His Thr His Thr His Thr Arg Cys lie 
610 615 620 



63 



<210> 62 
<211> 567 
<212> PRT 
<213> Humanus 



<400> 62 

Met Ala Gly Ala He Ala Ser Arg Met Ser Phe Ser Ser Leu Lys Arg 
1 5 10 15 

Lys Gin Pro Lys Thr Phe Thr Val Arg He Val Thr Met Asp Ala Glu 
20 25 30 

Met Glu Phe Asn Cys Glu Met Lys Trp Lys Gly Lys Asp Leu Phe Asp 
35 40 45 

Leu Val Cys Arg Thr Leu Gly Leu Arg Glu Thr Trp Phe Phe Gly Leu 
50 55 60 

Gin Tyr Thr He Lys Asp Thr Val Ala Trp Leu Lys Met Asp Lys Lys 
65 70 75 80 

Val Leu Asp His Asp Val Ser Lys Glu Glu Pro Val Thr Phe His Phe 
85 90 95 

Leu Ala Lys Phe Tyr Pro Glu Asn Ala Glu Glu Glu Leu Val Gin Glu 
100 105 110 

He Thr Gin His Leu Phe Phe Leu Gin Val Lys Lys Gin He Leu Asp 
115 120 125 

Glu Lys He Tyr Cys Pro Pro Glu Ala Ser Val Leu Leu Ala Ser Tyr 
130 135 140 

Ala Val Gin Ala Lys Tyr Gly Asp Tyr Asp Pro Ser Val His Lys Arg 
145 150 155 160 

Gly Phe Leu Ala Gin Glu Glu Leu Leu Pro Lys Arg Val He Asn Leu 
165 170 175 

Tyr Gin Met Thr Pro Glu Met Trp Glu Glu Arg He Thr Ala Trp Tyr 
. 180 185 190 

Ala Glu His Arg Gly Arg Ala Arg Asp Glu Ala Glu Met Glu Tyr Leu 
195 200 205 

Lys He Ala Gin Asp Leu Glu Met Tyr Gly Val Asn Tyr Phe Ala He 
210 215 220 

Arc? Asn Lys Lys Gly Thr Glu Leu Leu Leu Gly Val Asp Ala Leu Gly 
225 230 235 240 

Leu His He Tyr Asp Pro Glu Asn Arg Leu Thr Pro Lys He Ser Phe 
245 250 255 

Pro Trp Lys Asn Glu He Arg Asn He Ser Tyr Ser Asp Lys Glu Phe 
260 265 270 

Thr He Lys Pro Leu Asp Lys Lys He Asp Val Phe Lys Phe Asn Ser 
275 280 285 



64 



Ser Lys Leu Arg Val Asn Lys. Leu lie Leu Gin Leu Cys He Gly Asn 
290 295 300 

His Asp Leu Phe Met Arg Arg Arg Lys Ala Asp Ser Leu Glu Val Gin 
305 310 315 320 

Gin Met Lys Ala Gin Ala Arg Glu Glu Lys Ala Arg Lys Gin Met Lys 
325 330 335 

Glu Glu Ala Thr Met Ala Asn Glu Ala Leu Met Arg Ser Glu Glu Thr 
340 345 350 

Ala Asp Leu Leu Ala Glu Lys Ala Gin He Thr Glu Glu Glu Ala Lys 
355 360 365 

Leu Leu Ala Gin Lys Ala Ala Glu Ala Glu Gin Glu Met Gin Arg He 
370 375 380 

Lys Ala Thr Ala He Arg Thr Glu Glu Glu Lys Arg Leu Met Glu Gin 
385 390. 395 400 

Lys Val Leu Glu Ala Glu Val Leu Ala Leu Lys Met Ala Glu Glu Ser 
405 410 415 

Glu Arg Arg Ala Lys Glu Ala Asp Gin Leu Lys Gin Asp Leu Gin Glu 
420 425 430 

Ala Arg Glu Ala Glu Arg Arg Ala Lys Gin Lys Leu Leu Glu He Ala 
435 440 445 

Thr Lys Pro Thr Tyr Pro Pro Met Asn Pro He Pro Ala Pro Leu Pro 
450 455 460 

Pro Asp He Pro Ser Phe Asn Leu He Gly Asp Ser Leu Ser Phe Asp 
465 470 475 480 

Phe Lys Asp Thr Asp Met Lys Arg Leu Ser Met Glu He Glu Lys Glu 
485 490- 495 

Lys Val Glu Tyr Met Glu Lys Ser Lys His Leu Gin Glu Gin Leu Asn 
500 505 510 

Glu Leu Lys Thr Glu He Glu Ala Leu Lys Leu Lys Glu Arg Glu Thr 
515 520 525 

Ala Leu Asp He Leu His Asn Glu Asn Ser Asp Arg Gly Gly Ser Ser 
530 535 540 

Lys His Asn Thr He Lys Lys Leu Thr Leu Gin Ser Ala Lys Ser Arg 
545 550 555 560 

Val Ala Phe Phe Glu Glu Leu 
565 



<210> 63 
<211> 134 
<212> PRT 
<213> Humanus 



65 



<400> 63 
Met Arg Glu Arg 
1 

Asp Leu Leu Ala 

20 

lie Ala Leu Gly 
35 

Tyr Gly Ala Ser 
50 

Tyr lie Ser lie 
65 

Gin Trp Leu Thr 



Phe Phe Ser Asp 
100 

Lys Gin lie Tyr 
115 

Gly Arg Tyr Phe 
130 



Phe Asp Arg Phe 
5 

Lys Leu Glu Ala 



Val lie Gly Leu 
40 

Leu Leu Cys Asn 
55 

Lys Ala He Glu 
70 

Tyr Trp Val Val 

.85 

He Phe Leu Ser 



Leu Glu Pro Pro 
120 

Gly Glu 



Leu His Glu Lys 
10 

Lys Thr Gly Val 
25 

Val Ala Leu Tyr 



Leu He Gly Phe 
60 

Ser Pro Asn Lys 
75 

Tyr Gly Val Phe 
90 

Trp Phe Pro Phe 
105 

Cys Ala Arg Phe 



Asn Cys Met Thr 
15 

Asn Arg Ser Phe 
30 

Leu Val Phe Gly 
45 

Gly Tyr Pro Ala 



Glu Asp Asp Thr 
80 

Ser He Ala Glu 
95 

Tyr Tyr Met Leu 
110 

Cys Ser Thr Ser 
125 



<210> 64 
<211> 1278 
<212> PRT 
<213> Humanus 

<400> 64 
Met Asp Leu Glu 
1 

Phe Lys Leu Asn 
20 

Thr Val Ser Val 
35 

Leu Tyr Met Val 
50 

Leu Pro Leu Met 
65 

Asn Ala Gly Asn 



Asp He Asn Asp 
100 

Arg Tyr Ala Tyr 
115 



Gly Asp Arg Asn 
5 

Asn Lys Ser Glu 



Phe Ser Met Phe 
40 

Val Gly Thr Leu 
55 

Met Leu Val Phe 

70 ■ 

Leu Glu Asp Leu 
85 

Thr Gly Phe Phe 



Tyr Tyr Ser Gly 
120 



Gly Gly Ala Lys 
10 ' 

Lys Asp Lys Lys 
25 

Arg Tyr Ser Asn 



Ala Ala He He 
60 

Gly Glu Met Thr 
75 

Met Ser Asn He 
90 

Met Asn Leu Glu 
105 

He Gly Ala Gly 



Lys Lys Asn Phe 
15 

Glu Lys Lys Pro 
30 

Trp Leu Asp Lys 
45 

His Gly Ala Gly 



Asp He Phe Ala 
80 



Thr Asn Arg Ser 
95 

Glu Asp Met Thr 
110 

Val Leu Val Ala 
125 



66 



Ala Tyr He Gin Val Ser Phe Trp Cys Leu Ala Ala Gly Arg Gin He 
130 135 140 

His Lys He Arg Lys Gin Phe Phe His Ala He Met Arg Gin Glu He 
145 150 155 160 

Gly Trp Phe Asp Val His Asp Val Gly Glu Leu Asn Thr Arg Leu Thr 
165 170 175 

Asp Asp Val Ser Lys He Asn Glu Val He Gly Asp Lys He Gly Met 
180 185 190 

Phe Phe Gin Ser Met Ala Thr Phe Phe Thr Gly Phe He Val Gly Phe 
195 200 205 

Thr Arg Gly Trp Lys Leu Thr Leu Val lie Leu Ala He Ser Pro Val 
210 215 220 

Leu Gly Leu Ser Ala Ala Val Trp Ala Lys He Leu Ser Ser Phe Thr 
225 230 235 240 

Asp Lys Glu Leu Leu Ala Tyr Ala Lys Ala Gly Ala Val Ala Glu Glu 
245 250 255 

Val Leu Ala Ala He Arg Thr Val He Ala Phe Gly Gly Gin Lys Lys 
260 265 270 

Glu Leu Glu Arg Tyr Asn Lys Asn Leu Glu Glu Ala Lys Arg He Gly 
275 280 285 

He Lys Lys Ala He Thr Ala Asn He Ser He Gly Ala Ala Phe Leu 
290 295 300 

Leu He Tyr Ala Ser Tyr Ala Leu Ala Phe Trp Tyr Gly Thr Thr Leu 
305 310 315 320 

Val Leu Ser Gly Glu Tyr Ser He Gly Gin Val Leu Thr Val Phe Phe 
325 330 335 

Ser Val Leu He Gly Ala Phe Ser Val Gly Gin Ala Ser Pro Ser He 
340 345 350 

Glu Ala Phe Ala Asn Ala Arg Gly Ala Ala Tyr Glu He Phe Lys He 
355 360 365 

He Asp Asn Lys Pro Ser He Asp Ser Tyr Ser Lys Ser Gly His Lys 
370 375 380 

Pro Asp Asn He Lys Gly Asn Leu Glu Phe Arg Asn Val His Phe Ser 
385 390 395 400 

Tyr Pro Ser Arg Lys Glu Val Lys He Leu Lys Gly Leu Asn Leu Lys 
405 410 415 

Val Gin Ser Gly Gin Thr Val Ala Leu Val Gly Asn Ser Gly Cys Gly 
420 425 430 

Lys Ser Thr Thr Val Gin Leu Met Gin Arg Leu Tyr Asp Pro Thr Glu 
435 440 445 



Gly 



Met Val Ser Val Asp Gly Gin Asp He Arg Thr He Asn Val Arg 
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450 455 460 

Phe Leu Arg Glu lie lie Gly Val Val Ser Gin Glu Pro Val Leu Phe 
465 470 475 480 

Ala Thr Thr lie Ala Glu Asn lie Arg Tyr Gly Arg Glu Asn Val Thr 
485 490 495 

Met Asp Glu lie Glu Lys Ala Val Lys Glu Ala Asn Ala Tyr Asp Phe 
500 505 510 

He Met Lys Leu Pro His Lys Phe Asp Thr Leu Val Gly Glu Arg Gly 
515 520 525 

Ala Gin Leu Ser Gly Gly Gin Lys Gin Arg He Ala He Ala Arg Ala 
530 535 540 

Leu Val Arg Asn Pro Lys He Leu Leu Leu Asp Glu Ala Thr Ser Ala 
545 550 555 560 

Leu Asp Thr Glu Ser Glu Ala Val Val Gin Val Ala Leu Asp Lys Ala 
565 570 575 

Arg Lys Gly Arg Thr Thr He Val He Ala His Arg Leu Ser Thr Val 
580 585 590 

Arg Asn Ala Asp Val He Ala Gly Phe Asp Asp Gly Val He Val Glu 
595 600 605 

Lys Gly Asn His Asp Glu Leu Met Lys Glu Lys Gly He Tyr Phe Lys 
610 615 620 

Leu Val Thr Met Gin Thr Ala Gly Asn Glu Val Glu Leu Glu Asn Ala 
625 630 635 640 

Ala Asp Glu Ser Lys Ser Glu He Asp Ala Leu Glu Met Ser Ser Asn 
645 650 655 

Asp Ser Arg Ser Ser Leu He Arg Lys Arg" Ser Thr Arg Arg Ser Val 
660 665 670 

Arg Gly Ser Gin Ala Gin Asp Arg Lys Leu Ser Thr Lys Glu Ala Leu 
675 680 685 

Asp Glu Ser He Pro Pro Val Ser Phe Trp Arg He Met Lys Leu Asn 
690 695 700 

Leu Thr Glu Trp Pro Tyr Phe Val Val Gly Val Phe Cys Ala He He 
705 710 715 720 

Asn Gly Gly Leu Gin Pro Ala Phe Ala He He Phe Ser Lys He He 
725 730 735 

Gly Val Phe Thr Arg He Asp Asp Pro Glu Thr Lys Arg Gin Asn Ser 
740 745 750 

Asn Leu Phe Ser Leu Leu Phe Leu Ala Leu Gly He He Ser Phe He 
755 760 765 

Thr Phe Phe Leu Gin Gly Phe Thr Phe Gly Lys Ala Gly Glu He Leu 
770 775 780 
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Thr Lys Arg Leu Arg Tyr Met Val Phe Arg Ser Met Leu Arg Gin Asp 
785 790 795 800 

Val Ser Trp Phe Asp Asp Pro Lys Asn Thr Thr Gly Ala Leu Thr Thr 
805 810 815 

Arq Leu Ala Asn Asp Ala Ala Gin Val Lys Gly Ala lie Gly Ser Arg 
820 825 830 

Leu Ala Val He Thr Gin Asn He Ala Asn Leu Gly. Thr Gly He He 
835 840 845 

He Ser Phe He Tyr Gly Trp Gin Leu Thr Leu Leu Leu Leu Ala He 
850 855 860 

Val Pro He He Ala He Ala Gly Val Val Glu Met Lys Met Leu Ser 
865 870 875 880 

Glv Gin Ala Leu Lys Asp Lys Lys Glu Leu Glu Gly Ala Gly Lys He 
885 890 895 

Ala Thr Glu Ala He Glu Asn Phe Arg Thr Val Val Ser Leu Thr Gin 
900 905 910 

Glu Gin Lys Phe Glu His Met Tyr Ala Gin Ser Leu Gin Val Pro Tyr 
915 920 925 

Arg Asn Ser Leu Arg Lys Ala His He Phe Gly He Thr Phe Ser Phe 
930 935 940 

Thr Gin Ala Met Met Tyr Phe Ser Tyr Ala Gly Cys Phe Arg Phe Gly 
945 950 955 960 

Ala Tvr Leu Val Ala His Lys Leu Met Ser Phe Glu Asp Val Leu Leu 
965 970 975 

Val Phe Ser Ala Val Val Phe Gly Ala Met .Ala Val Gly Gin Val Ser 
980 985 990 

Ser Phe Ala Pro Asp Tyr Ala Lys Ala Lys He Ser Ala Ala His He 
995 1000 1005 

He Met He He Glu Lys Thr Pro Leu He Asp Ser Tyr Ser Thr Glu 
1010 1015 1020 

Gly Leu Met Pro Asn Thr Leu Glu Gly Asn Val Thr Phe Gly Glu Val 
1025 1030 1035 1040 

Val Phe Asn Tyr Pro Thr Arg Pro Asp He Pro Val Leu Gin Gly Leu 
1045 1050 1055 

Ser Leu Glu Val Lys Lys Gly Gin Thr Leu Ala Leu Val Gly Ser Ser 
1060 1065 1070 

Gly Cys Gly Lys Ser Thr Val Val Gin Leu Leu Glu Arg Phe Tyr Asp 
1075 1080 1085 

Pro Leu Ala Gly Lys Val Leu Leu Asp Gly Lys Glu He Lys Arg Leu 
1090 1095 1100 
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Asn Val Gin Trp Leu Arg Ala His Leu Gly lie Val Ser Gin Glu Pro 
1105 1110 1115 1120 

He Leu Phe Asp Cys Ser He Ala Glu Asn He Ala Tyr Gly Asp Asn 
1125 1130 1135 

Ser Arg Val Val Ser Gin Glu Glu He Val Arg Ala Ala Lys Glu Ala 
1140 1145 1150 

Asn He His Ala Phe He Glu Ser Leu Pro Asn Lys Tyr Ser Thr Lys 
1155 H60 1165 

Val Gly Asp Lys Gly Thr Gin Leu Ser Gly Gly Gin Lys Gin Arg He 
H70 1175 1180 

Ala He Ala Arg Ala Leu Val Arg Gin Pro His He Leu Leu Leu Asp 
H85 1190 1195 1200 

Glu Ala Thr Ser Ala Leu Asp Thr Glu Ser Glu Lys Val Val. Gin Glu 
1205 1210 1215 

Ala Leu Asp Lys Ala Arg Glu Gly Arg Thr Cys He Val He Ala His 
1220 1225 1230 

Arg Leu Ser Thr He Gin Asn Ala Asp Leu He Val Val Phe Gin Asn 
1235 1240 1245 

Gly Arg Val Lys Glu His Gly Thr His Gin Gin Leu Leu Ala Gin Lys 
1250 1255 1260 

Gly He Tyr Phe Ser Met Val Ser Val Gin Ala Gly Thr He 
1265 1270 1275 



<210> 65 

<211>. 579 

<212> PRT 

<213> Humanus 

<400> 65 

Met Asp Leu Glu Gly Asp Arg Asn Gly Gly Ala Lys Lys Lys Asn Phe 
15 10 15 

Phe Lys Leu Asn Asn Lys Ser Glu Lys Asp Lys Lys Glu Lys Lys Pro 
20 25 30 

Thr Val Ser Val Phe Ser Met Phe Arg Tyr Ser Asn Trp Leu Asp Lys 
35 40 45 

Leu Tyr Met Val Val Gly Thr Leu Ala Ala He He His Gly Ala Gly 
50 55 60 

Leu Pro Leu Met Met Leu Val Phe Gly Glu Met Thr Asp He Phe Ala 
65 70 75 80 

Asn Ala Gly Asn Leu Glu Asp Leu Met Ser Asn He Thr Asn Arg Ser 
85 90 95 

Asp He Asn Asp Thr Gly Phe Phe Met Asn Leu Glu Glu Asp Met Thr 
100 105 HO 
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Arq Tyr Ala Tyr Tyr Tyr Ser Gly He Gly Ala Gly Val Leu Val Ala 
115 120 125 

Ala Tyr He Gin Val Ser ' Phe Trp Cys Leu Ala Ala Gly Arg Gin He 
130 135 140 

His Lys lie Arg Lys Gin Phe Phe His Ala He Met Arg Gin Glu lie 
145 150 155 160 

Gly Trp Phe Asp Val His Asp Val Gly Glu Leu Asn Thr Arg Leu Thr 
165 l" 70 175 

Asp Asp Val Ser Lys He Asn Glu Gly He Gly Asp Lys He Gly Met 
180 185 190 

Phe Phe Gin Ser Met Ala Thr Phe Phe Thr Gly Phe He Val Gly Phe 
195 200 205 

Thr Arq Gly Trp Lys Leu Thr Leu Val He Leu Ala He Ser Pro Val 
210 215 220 

Leu Gly Leu Ser Ala Ala Val Trp Ala Lys He Leu Ser Ser Phe Thr 

230 235 240 



225 



Asp Lys Glu Leu Leu Ala Tyr Ala Lys Ala Gly Ala Val Ala Glu Glu 
245 250 2=>b 

Val Leu Ala Ala He Arg Thr Val He Ala Phe Gly Gly Gin Lys Lys 
260 265 270 

Glu Leu Glu Arg Tyr Asn Lys Asn Leu Glu Glu Ala Lys Arg He Gly 
275 280 285 

He Lys Lys Ala He Thr Ala Asn He Ser He Gly Ala Ala Phe Leu 
290 295 300 

Leu He Tyr Ala Ser Tyr Ala Leu Ala Phe Trp Tyr Gly Thr Thr Leu 
305 310 315 320 

Val Leu Ser Gly Glu Tyr Ser He Gly Gin Val Leu Thr Val Phe Phe 
325 330 335 

Ser Val Leu He Gly Ala Phe Ser Val Gly Gin Ala Ser Pro Ser He 
340 345 350 

Glu Ala Phe Ala Asn Ala Arg Gly Ala Ala Tyr Glu He Phe Lys He 
355 360 365 

He Asp Asn Lys Pro Ser He Asp Ser Tyr Ser Lys Ser Gly His Lys 
370 375 380 



Pro Asp Asn He Lys Gly Asn Leu Glu Phe Arg Asn Val His Phe Ser 
385 



390 395 400 



Tyr Pro Ser Arg Lys Glu Val Lys He Leu Lys Gly Leu Asn Leu Lys 
405 410 415 

Val Gin Ser Gly Gin Thr Val Ala Leu Val Gly Asn Ser Gly Cys Gly 
420 425 430 
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Lys Ser Thr Thr Val Gin Leu Met Gin Arg Leu Tyr Asp Pro Thr Glu 
435 440 445 

Gly Met Val Ser Val Asp Gly Gin Asp lie Arg Thr lie Asn Val Arg 
450 455 460 

Phe Leu Arq Glu He He Gly Val Val Ser_Gln Glu Pro Val Leu Phe 
465 470 475 480 

Ala Thr Thr He Ala Glu Asn He Arg Tyr Gly Arg Glu Asn Val Thr 
485 490 495 

Met Asp Glu He Glu Lys Ala Val Lys Glu Ala Asn Ala Tyr Asp Phe 
500 505 510 

He Met Lys Leu Pro His Lys Phe Asp Thr Leu Val Gly Glu Arg Gly 
515 520 525 

Ala Gin Leu Ser Gly Gly Gin Lys Gin Arg He Ala He Ala Arg Ala 
530 535 540 

Leu Val Arg Asn Pro Lys He Leu Leu Leu Asp Glu Ala Thr Ser Ala 
545 550 555 560 

Leu Asp Thr Glu Ser Glu Ala Glu Val Gin Ala Ala Leu Asp Lys Val 
565 570 575 

Ser Arg Leu 



<210> 66 
<211> 218 
<212> PRT 
<213> Humanus 

<400> 66 , „, T . _ n 

Met Ser Arg Ser Lys Arg Asp Asn Asn Phe Tyr Ser Val Glu He Gly 
1 5 • 10 15 

Aso Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys Pro He 
20 25 30 

Gly Ser Gly Ala Gin Gly He Val Cys Ala Ala Tyr Asp Ala He Leu 
35 40 45 

Glu Arg Asn Val Ala He Lys Lys Leu Ser Arg Pro Phe Gin Asn Gin 
50 55 60 

Thr His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys Cys Val 
65 70 75 80 

Asn His Lys Asn He He Gly Leu Leu Asn Val Phe Thr Pro Gin Lys 
85 90 95 

Ser Leu Glu Glu Phe Gin Asp Val Tyr He Val Met Glu Leu Met Asp 
100 105 HO 

Ala Asn Leu Cys Gin Val He Gin Met Glu Leu Asp His Glu Arg Met 
115 120 125 
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Ser Tyr Leu Leu Tyr Gin Met Leu 
130 135 

Ala Gly lie He His Arg Asp Leu 
145 150 

Ser Asp Cys Thr Leu Lys He Leu 
165 

Gly Thr Ser Phe Met Met Thr Pro 
180 

Ala Pro Glu Val He Leu Gly Met 
195 200 

Met Gly Lys Gly He Phe Thr Arg 
210 215 



Cys Sly He Lys His Leu His Ser 
140 

Lys Pro Ser Asn He Val Val Lys 

155 160 

Asp Phe Gly Leu Ala Arg Thr Ala 
170 175 

Tyr Val Val Thr Arg Tyr Tyr Arg 
185 190 

Gly Tyr Lys Glu Asn Gly Gly Arg 
205 

Leu Gin 



<210> 67 
<211> 307 
<212> PRT 
<213> Humanus 

<400> 67 

Met Ser Arg Ser Lys Arg Asp Asn Asn Phe Tyr Ser Val Glu He Gly 
15 10 15 

Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys Pro He 
20 25 30 

Gly Ser Gly Ala Gin Gly He Val Cys Ala Ala Tyr Asp Ala He Leu 
35 40 45 

Glu Arg Asn Val Ala He Lys Lys Leu Ser Arg Pro Phe Gin Asn Gin 
50 55 60 

Thr His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys Cys Val 
65 70 75 80 

Asn His Lys Asn He He Gly Leu Leu Asn Val Phe Thr Pro Gin Lys 
85 90 95 

Ser Leu Glu Glu Phe Gin Asp Val Tyr He Val Met Glu Leu Met Asp 
100 105 HO 

Ala Asn Leu Cys Gin Val He Gin Met Glu Leu Asp His Glu Arg Met 
115 120 125 

Ser Tyr Leu Leu Tyr Gin Met Leu Cys Gly He Lys His Leu His Ser 
130 135 140 

Ala Gly He He His Arg Asp Leu Lys Pro Ser Asn He Val Val Lys 
145 150 155 160 

Ser Asp Cys Thr Leu Lys He Leu Asp Phe Gly Leu Ala Arg Thr Ala 
165 170 175 

Gly Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr Tyr Arg 
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180 



185 190 



Ala Pro Glu Val He Leu Gly Met Gly Tyr Lys Glu Asn Val Asp Leu 
195 200 205 

Tro Ser Val Gly Cys He Met Gly Glu Met Val Cys His Lys He Leu 
210 215 220 

Phe Pro Gly Arg Asp Tyr lie Asp Gin Trp Asn Lys Val He Glu Gin 
225 230 235 240 

Leu Gly Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro Thr Val 
245 250 255 



Ara Thr Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Tyr Ser Phe Glu 
260 265 270 

Lys Leu Phe Pro Asp 



275 



260 265 

Val Leu Phe Pro Ala Asp Ser Glu His Asn Lys 
280 285 

Phe Leu Gin He Cys Thr Phe Asn Pro He 



Leu Lys Ala Ser Gin Tyr 

290 295 300 

Trp Gly Val 
305 



<210> 68 
<211> 339 
<212> PRT 
<213> Humanus 

Met°Ser 8 Arg Ser Lys Arg Asp Asn Asn Phe Tyr Ser Val Glu He Gly 
1 5 10 15 

Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys Pro He 
20 25 30 

Gly Ser Gly Ala Gin Gly He Val Cys Ala Ala Tyr Asp Ala He Leu 
35 40 45 

Glu Arg Asn Val Ala lie Lys Lys Leu Ser Arg Pro Phe Gin Asn Gin 
50 55 60 

Thr His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys Cys Val 
65 "70 75 80 

Asn His Lys Asn He He Gly Leu Leu Asn Val Phe Thr Pro Gin Lys 
85 90 95 

Ser Leu Glu Glu Phe Gin Asp Val Tyr He Val Met Glu Leu Met Asp 

100 105 110 

Ala Asn Leu Cys Gin Val He Gin Met Glu Leu Asp His Glu Arg Met 



115 



120 125 



Ser Tyr Leu Leu Tyr Gin Met Leu Cys Gly He Lys His Leu His Ser 
130 135 140 
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Ala Gly He He His Arg Asp Leu Lys Pro Ser Asn He Val Val Lys 
145 150 155 160 

Ser Asp Cys Thr Leu Lys He Leu Asp Phe Gly Leu Ala Arg Thr Ala 
165 l" 70 175 

Glv Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr Tyr Arg 
180 I 85 190 

Ala Pro Glu Val He Leu Gly Met Gly Tyr Lys Glu Asn Val Asp Leu 
195 200 205 

Trp Ser Val Gly Cys He Met Gly Glu Met Val Cys His Lys He Leu 
210 215 220 

Phe Pro Gly Arg Asp Tyr He Asp Gin Trp Asn Lys Val He Glu Gin 
225 230 235 240 

Leu Gly Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro Thr Val 
245 250 

Arg Thr Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Tyr Ser Phe Glu 
* „ n 265 270 



260 



Lys Leu Phe Pro Asp Val Leu Phe Pro Ala Asp Ser Glu His Asn Lys 
275 280 285 

Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu Ser Lys Met Leu Val He 
290 295 300 

Asp Ala Ser Lys Arg He Ser Val Asp Glu Ala Leu Gin His Pro Tyr 
305 310 -315 

lie Asn Val Trp Tyr Asp Pro Ser Glu Ala Glu Ala Arg Ser Cys Lys 
325 330 JJ 

Leu Phe Ser 
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